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TITLE OF THE INVENTION 

MELANIN CONCENTRATING HORMONE RECEPTOR CHIMERIC AND 
FUSION PROTEINS 

5 CROSS-REFERENCE TO RELATED APPLICATIONS 

The present application claims priority to provisional application U.S. 
Serial No. 60/189,698, filed March 15, 2000, hereby incorporated by reference herein. 

BACKGROUND OF THE INVENTION 

10 The references cited herein are not admitted to be prior art to the 

claimed invention. 

Neuropeptides present in the hypothalamus play a major role in 
mediating the control of body weight. (Flier et al, 1998. Cell, 92, 437-440.) Melanin- 
concentrating hormone (MCH) is a cyclic 19-amino acid neuropeptide synthesized as 

15 part of a larger pre-prohormone precursor in the hypothalamus which also encodes 
neuropeptides NEI and NGE. (Nahon et al, 1990. Mol Endocrinol 4, 632-637.) 
MCH was first identified in salmon pituitary, and in fish MCH affects melanin 
aggregation thus affecting skin pigmentation. In trout and in eels MCH has also been 
shown to be involved in stress induced or CRF-stimulated ACTH release. (Kawauchi 

20 et al, 1983. Nature 305, 321-323.) 

In humans two genes encoding MCH have been identified that are 
expressed in the brain. (Breton et al, 1993. Mol Brain Res. 18, 297-310.) In 
mammals MCH has been localized primarily to neuronal cell bodies of the 
hypothalamus which are implicated in the control of food intake, including perikarya 

25 of the lateral hypothalamus and zona inertia. (Knigge et al } 1996. Peptides 17, 1063- 
1073.) 

Pharmacological and genetic evidence suggest that the primary mode 
of MCH action is to promote feeding (orexigenic). MCH mRNA is up regulated in 
fasted mice and rats and in the ob/ob mouse. (Qu et al, 1996. Nature 380, 243-247.) 
30 Injection of MCH centrally (ICV) stimulates food intake and MCH antagonizes the 
hypophagic effects seen with cc-melanocyte stimulating hormone (ocMSH). (Qu et 
al, 1996. Nature 380, 243-247.) MCH-deficient mice are lean, hypophagic, and have 
increased metabolic rate. (Shimada et al, 1998. Nature 396, 670-673.) 

MCH action is not limited to modulation of food intake as effects on 
35 the hypothalamic-pituitary-axis have been reported. (Nahon 1994. Critical Rev. in 
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Neurobiol. 8, 221-262.) MCH may be involved in the body response to stress as 
MCH can modulate the stress-induced release of CRF from the hypothalamus and 
ACTH from the pituitary. In addition, MCH neuronal systems may be involved in 
reproductive or maternal function. 
5 Several references describe a receptor that is indicated to bind MCH. 

(Chambers et al, 1999. Nature 400, 261-265; Saito et al, 1999. Nature 400, 265-269; 
BSchnerefa/., 1999. FEBS Letters 457:522-524; Shimomuraef al, 1999. 
Biochemical and Biophysical Research Communications 261, 622-626; and Lembo et 
al, 1999. Nat. Cell Biol 1, 267-271.) 

10 

SUMMARY OF THE INVENTION 

The present invention features melanin concentrating hormone receptor 
(MCH-R) chimeric and fusion proteins. MCH-R chimeric proteins comprise an 
MCH-R polypeptide region made up of at least two or more polypeptide regions 
1 5 characteristic of MCH-R found in different species. MCH-R fusion proteins comprise 
an MCH-R polypeptide region and a fluorescent protein region. 

An MCH-R polypeptide region provides a functional G-protein 

coupled receptor region able to bind MCH and transduce an intracellular signal. 

Examples of MCH-R polypeptide regions include naturally occurring MCH-R, 
20 chimeric MCH-R containing two or more regions from naturally occurring MCH-R, 

and functional derivatives thereof. 

Reference to the terms "characteristic" and "derivatives thereof 

describe a relationship to a reference sequence. In both cases, there is at least about 

75% sequence similarity to the reference sequence. 
2 5 Thus, a first aspect of the present invention describes a fusion protein 

comprising (a) an MCH-R polypeptide region and (b) a fluorescent polypeptide 

region. The fluorescent polypeptide region is joined directly, or though a polypeptide 

linker, to the carboxy side of the MCH-R polypeptide region. 

Another aspect of the present invention describes an MCH-R chimeric 
30 protein. The protein comprises: (a) an MCH-R binding region characteristic of a 

human MCH-R, (b) a transmembrane domain characteristic of a non-human MCH-R, 

and (c) an intracellular domain characteristic of a non-human MCH-R. 

Another aspect of the present invention describes a nucleic acid 

encoding for an MCH-R fusion protein or an MCH-R chimeric protein described 
35 herein. Such nucleic acid comprises either a contiguous nucleotide sequence that 
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codes for the protein or a sequence that is processed by a host cell to produce a 
contiguous nucleotide sequence encoding for the protein. Processing of a nucleic acid 
sequence to produce a contiguous nucleotide sequence encoding for a protein can 
occur by the splicing together of exons resulting in intron removal. 
5 Another aspect of the present invention describes an expression vector 

comprising a nucleic acid encoding for an MCH-R fusion protein or an MCH-R 
chimeric protein described herein. 

Another aspect of the present invention describes a recombinant cell 
comprising nucleic acid encoding for an MCH-R fusion protein or an MCH-R 
10 chimeric protein described herein. The nucleic acid may be part of the host genome 
or may exist independently of the host genome. 

Another aspect of the present invention describes a non-human 
transgenic animal comprising nucleic acid encoding for an MCH-R fusion protein or 
an MCH-R chimeric protein described herein, 
j 5 Another aspect of the present invention describes a method for 

assaying for MCH-R active compounds by measuring the effect of a test preparation 
on one or more MCH-R activities. The method is performed using either an MCH-R 
fusion protein or an MCH-R chimeric protein described herein. 

Other features and advantages of the present invention are apparent 
20 from the additional descriptions provided herein including the different examples. 
The provided examples illustrate different components and methodology useful in 
practicing the present invention. The examples do not limit the claimed invention. 
Based on the present disclosure the skilled artisan can identify and employ other 
components and methodology useful for practicing the present invention. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates aequorin assay results comparing a mouse MCH-R 
fusion with a human wild type MCH-R and a CMV-EGFP control. 

Figure 2 illustrates a cAMP flashplate assay of CHO cell clones stably 
30 expressing mMCH- 1R-EGFP. Cells from individual clones were dissociated in 
enzyme free media and stimulated for 15 minutes at 37°C with human MCH at the 
indicated concentrations in the presence of 10 uM forskolin. Cells were then lysed 
and assayed for bound [ 125 I]cAMP. Mouse MCH-1R-EGFP clones exhibited EC50 
values (0.111 1, 0.1255, 0.1291, or 0.2304 nM) indistinguishable from that of a CHO 
35 cell clone expressing the wild-type human short isoform of MCH-1R ( 0.1282 nM). 
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Figure 3 illustrates a cAMP flashplate assay of CHO cell clones stably 
expressing human short/mouse species chimeric MCH-1R-EGFP. Cells from 
individual clones were dissociated in enzyme free media and stimulated for 15 
minutes at 37°C with human MCH at the indicated concentrations in the presence of 
5 10 MM forskolin. Cells were then lysed and assayed for bound [ ,2S I]cAMP. Human 
short/mouse species chimeric MCH-1R-EGFP clones exhibited EC50 values (0.0366, 
0.0462, 0.21 17, or 0.2499 nM) indistinguishable from that of a CHO cell clone 
expressing the wild-type human short isof orm of MCH-1R (0. 1 137 nM). 

10 DETAILED DESCRIPTION OF THE INVENTION 

The present invention features MCH-R chimeric and fusion proteins. 
Such proteins have a variety of different uses including being used as a research tool 
to study MCH-R function and dynamics, and being used to screen for MCH-R 
agonists and antagonists. 

15 The MCH-R provides a target to achieve different beneficial effects in 

a patient. Preferably, MCH-R activity is modulated to achieve one or more of the 
following: weight loss, weight gain, treat cancer {e.g., colon or breast), reduce pain, 
treat diabetes, reduce stress, or teat sexual dysfunction. 

Modulation of MCH-R activity can be achieved by evoking a response 

20 at the MCH receptor or by altering a response evoked by an MCH receptor agonist or 
antagonist. Compounds modulating MCH-R receptor activity include agonists, 
antagonists, and allosteric modulators. Generally, MCH-R antagonists and allosteric 
modulators negatively affecting activity will be used to achieve weight loss, treat 
cancer (e.g., colon or breast), reduce pain, reduce stress, or teat sexual dysfunction; 

25 and MCH-R agonists and allosteric modulators positively affecting activity will be 
used to produce a weight gain. 

Preferably, MCH-R activity is modulated to achieve a weight loss or to 
treat diabetes in a patient. Diabetes mellitus can be treated by modulating MCH-R 
activity to achieve, for example, one or both of the following: enhancing glucose 

30 tolerance or decreasing insulin resistance. 

Excessive body weight is a contributing factor to different diseases, 
including hypertension, diabetes, dyslipidemias, cardiovascular disease, gall stones, 
osteoarthritis, and certain forms of cancers. Bringing about a weight loss can be used, 
for example, to reduce the likelihood of such diseases and as part of a treatment for 
35 su ch diseases. Weight reduction can be achieved by modulating MCH-R activity to 
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obtain, for example, one or more of the following effects: reducing appetite, 
increasing metabolic rate, reducing fat intake, or reducing carbohydrate craving. 

Increasing body weight is particularly useful for a patient having a 
disease or disorder, or under going a treatment, accompanied by weight loss. 
5 Examples of diseases or disorders accompanied by weight loss include anorexia, 
AIDS, wasting, cachexia, and frail elderly. Examples of treatments accompanied by 
weight loss include chemotherapy and radiation therapy. 

MPH-R Chimeric Proteins 

10 ~ MCH-R chimeric proteins contain an MCH-R polypeptide region made 

up by at least two or more polypeptide regions characteristic of MCH-R found in 
different species. The different polypeptide regions that are present provide for an N- 
terminal extracellular domain; a transmembrane domain made up of transmembrane 
regions, extracellular loop regions, and intracellular loop regions; and an intracellular 

15 carboxy terminus domain. Examples of MCH-R amino acid sequences include the 
following: SEQ. ID. NO. 1 (human MCH1R long form), SEQ. ID. NO. 2 (human 
MCH1R short form), and SEQ, ID. NO. 3 (mouse MCH1R). 

Preferably, the MCH-R chimeric protein comprises an MCH-R binding 
region characteristic of a human MCH-R along with transmembrane and intracellular 

20 domains characteristic of a non-human MCH-R. There are substantial amino acid 

differences between the N-terminus of the MCH-R present in humans and that present 
in other species such as mice. Such differences could result in, for example, the 
mouse MCH-R having different intrinsic properties and responsiveness to agonists 
and/or antagonists than the human MCH-R. The presence of a human MCH-R 

25 binding region provides for a "humanized" MCH-R chimeric receptor. 

The transmembrane and intracellular domains characteristic of a non- 
human MCH-R can be used in conjunction with a non-human host to provide a more 
naturally occurring environment for these regions. For example, an MCH-R chimeric 
having mouse transmembrane and intracellular domains are preferably used in murine 

30 cells lines or in transgenic mice. 

MCH-R chimeric proteins may contain regions other than extracellular, 
transmembrane, and intracellular domains that do not substantially decrease the 
activity of the protein. Preferably, additional regions do not cause a decrease of more 
than about 25% of MCH-R activity as measured using one or more of the assays 
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described in the examples provided below. Examples of additional regions that may 
be present include fluorescent protein regions and linker regions. 

In an embodiment of the present invention, the MCH-R chimeric 
protein comprises: (a) an MCH binding region characteristic of a first species and (b) 
5 a transmembrane and intracellular domain region characteristic of a second species 
joined directly, or though a linker, to the carboxy side of the MCH binding region. 
Preferably, the protein comprises, consists, or consists essentially of an MCH-R 
polypeptide having a sequence similarity of at least about 75%, at least 85%, or at 
least 95% with either SEQ. ID. NO. 4 (human short form/mouse species chimeric 
10 MCH1R) or SEQ. ED. NO. 5 (human long form/mouse species chimeric). Even more 
preferably, the protein comprises, consists essentially of, or consists of, SEQ. ID. NO. 
4 or SEQ. ID. NO. 5. 

Sequence similarity for polypeptides can be determined by BLAST. 
(Altschul et al, 1997. Nucleic Acids Res. 25, 3389-3402, hereby incorporated by 
15 reference herein.) In an embodiment of the present invention, sequence similarity is 
determined using tBLASTn search program with the following parameters: 
MATRK:BLOSUM62, PER RESIDUE GAP COST: 1 1, and Lambda ratio: 1 . 

Differences in naturally occurring amino acids are due to different R 
groups. An R group effects different properties of the amino acid such as physical 
20 size, charge, and hydrophobicity. Amino acids can be divided into different groups as 
follows: neutral and hydrophobic (alanine, valine, leucine, isoleucine, proline, 
tryptophan, phenylalanine, and methionine); neutral and polar (glycine, serine, 
threonine, tyrosine, cysteine, asparagine, and glutamine); basic (lysine, arginine, and 
histidine); and acidic (aspartic acid and glutamic acid). 
25 Generally, in substituting different amino acids it is preferable to 

exchange amino acids having similar properties. Substituting different amino acids 
within a particular group, such as substituting valine for leucine, arginine for lysine, 
and asparagine for glutamine are good candidates for not causing a change in 
polypeptide functioning. 
30 Changes outside of different amino acids groups can also be made. 

Preferably, such changes are made taking into account the position of the amino acid 
to be substituted in the polypeptide. For example, arginine can substitute more freely 
for nonpolor amino acids in the interior of a polypeptide then glutamate because of its 
long aliphatic side chain. (See, Ausubel, Current Protocols in Molecular Biology, 
35 John Wiley, 1987-1998, Supplement 33 Appendix 1C.) 
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MHH-R Fusion Proteins 

MCH-R fusion proteins contain an MCH-R polypeptide region and a 
fluorescent protein region either directly joined together or joined together through a 
5 linker. These regions provide MCH-R activity and a marker for evaluating MCH-R 
dynamics. 

An MCH-R polypeptide region provides functional MCH-R activity 
and includes naturally occurring MCH-R, chimeric MCH-R, and derivatives thereof. 
Preferred derivatives thereof have a sequence similarity of at least about 75%, at least 

10 about 85%, or at least about 95% to a naturally occurring MCH-R or a chimeric 
MCH-R described herein. 

A fluorescent protein region contains a chromophore that fluoresces. 
Preferably, the fluorescent protein region is the green fluorescent protein of the 
jellyfish Aequorea victoria or a derivative thereof. Preferred derivatives have a 

15 sequence similarity of at least about 75%, at least about 85%, or at least about 95% to 
the Aequorea victoria green fluorescent protein (GFP). The Aequorea victoria green 
fluorescent protein and examples of derivatives thereof are described by Cormack et 
al, 1996. Gene 17, 33-38; Yang et al, 1996. Nucleic Acids Research24, 4592-4593; 
Tsien et al, U.S. Patent No. 5,625,048; Tsien et al, U.S. Patent No. 5,777,079; and 

20 Cormack et al., U.S. Patent No. 5,804,387 (each of which are hereby incorporated by 

reference herein). 

In different embodiments the MCH-R polypeptide region comprises, 
consists essentially of, or consists of, a sequence selected from the group consisting 
of: SEQ. ID. NO. 1, SEQ. ID. NO. 2, SEQ. ID. NO. 3, SEQ. ID. NO. 4, and SEQ. ID. 
25 NO. 5; and the fluorescent polypeptide region comprises, consists essentially of, or 
consists of, an amino acid sequence selected from the group consisting of SEQ. ID. 
NO. 6 (GFP), SEQ. ID. NO. 7 (EGFP), SEQ. ID. NO. 8 (Emerald), SEQ. ID. NO. 9 
(Topaz), and SEQ. ID. NO. 10 (Wlb). EGFP, Emerald, Topaz, and Wlb are 
derivatives of GFP. 

30 The optionally present linker is a polypeptide region that is preferably 

from 1 to about 100 amino acids in length. In different embodiments the linker is up 
to 75, 50 or 25 amino acids in length. 

Preferably, the MCH-R fusion protein comprises, consists essentially 
of, or consists of, the MCH-R polypeptide region and the fluorescent polypeptide 

35 region. More preferably, the protein comprises, consists essentially of, or consists of, 
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an amino acid sequence selected from the group consisting of: SEQ. ID. NO. 1 1 
(mouse MCHlR-linker-EGFP), SEQ. ID. NO. 12 (mouse MCH1R/EGFP direct 
fusion), SEQ. ID. NO. 13 (human short form/mouse species chimeric MCHlR-linker- 
EGFP), or SEQ. ID. NO. 14 (human long form/mouse species chimeric MCH1R- 
5 linker-EGFP). 

•N/IPH-R Chimeric and Fusion Pr oteins Nucleic Acid and Expression 

MCH-R chimeric and fusion proteins can be produced using 
techniques well known in the art. Preferably, such proteins are produced by 
10 recombinant expression inside a host cell by way of an expression vector or by way of 
nucleic acid integrated into the host genome. Examples of nucleic acid sequences 
encoding for MCH-R polypeptide regions, fluorescent protein regions, MCH-R 
chimeric proteins, and MCH-R fusion proteins are provided for by SEQ. ID. NOs. 15- 
29 (see Example 1, infra). 
1 5 Starting with a particular amino acid sequence and the known 

degeneracy of the genetic code, a large number of different encoding nucleic acid 
sequences can be obtained. The degeneracy of the genetic code arises because almost 
all amino acids are encoded for by different combinations of nucleotide triplets or 
codons. The translation of a particular codon into a particular amino acid is well 
20 known in the art (see, e.g., Lewin GENES IV, p. 119, Oxford University Press, 1990). 
Amino acids are encoded for by codons as follows: 
A=Ala=Alanine: codons GCA, GCC, GCG, GCU 
C=Cys=Cysteine: codons UGC, UGU 
D=Asp=Aspartic acid: codons GAC, GAU 
25 E=Glu=Glutamic acid: codons GAA, GAG 
F=Phe=Phenylalanine: codons UUC, UUU 
G=Gly=Glycine: codons GGA, GGC, GGG, GGU 
H=His=Histidine: codons CAC, CAU 
I=He=Isoleucine: codons AUA, AUC, AUU 
30 K=Lys=Lysine: codons AAA, AAG 

L=Leu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU 
M=Met=Methionine: codon AUG 
N=Asn=Asparagine: codons AAC, AAU 
p=Pro=Proline: codons CCA, CCC, CCG, CCU 
35 Q=Gln=Glutamine: codons CAA, CAG 
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R=A rg=Arginine: codons AGA, AGG, CGA, CGC, CGG, CGU 
S=Ser=Serine: codons AGC, AGU, UCA, UCC, UCG, UCU 
T=Thr=Threonine: codons ACA, ACC, ACG, ACU 
V=Val=Valine: codons GUA, GUC, GUG, GUU 
5 w=Trp=Tryptophan: codon UGG 
Y=Tyr=Tyrosine: codons UAC, UAU 

Examples of techniques for introducing nucleic acid into a cell and 
expressing the nucleic acid to produce protein are provided in references such as 
Ausubel, Current Protocols in Molecular Biology, John Wiley, 1987-1998, and 
10 Sambrook, et al, in Molecular Cloning, A Laboratory Manual, 2 nd Edition, Cold 
Spring Harbor Laboratory Press, 1989. 

An expression vector contains recombinant nucleic acid encoding for a 
polypeptide along with regulatory elements for proper transcription and processing. 
The recombinant nucleic acid contains two or more nucleic acid regions not naturally 
15 associated with each other. Exogenous regulatory elements such as an exogenous 
promoter can be useful for expressing recombinant nucleic acid in a particular host. 
Examples of expression vectors are cloning vectors, modified cloning vectors, 
specifically designed plasmids, and viruses. 

Generally, the regulatory elements that are present in an expression 
20 vector include a transcriptional promoter, a ribosome binding site, a terminator, and 
an optionally present operator. Another preferred element is a polyadenylation signal 
providing for processing in eukaryotic cells. Preferably, an expression vector also 
contains an origin of replication for autonomous replication in a host cell, a selectable 
marker, a limited number of useful restriction enzyme sites, and a potential for high 
25 copy number. 

Expression vectors providing suitable levels of polypeptide expression 
in different hosts are well known in the art. Mammalian expression vectors well 
known in the art include pcDNA3 (Invitrogen), pMClneo (Stratagene), pXTl 
(Stratagene), pSG5 (Stratagene), EBO-pSV2-neo (ATCC 37593), pBPV-l(8-2) 
30 (ATCC 37110), pdBPV-MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 37199), 
pRSVneo (ATCC 37198), pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), pCI- 
neo (Promega) and .lambda.ZD35 (ATCC 37565). Bacterial expression vectors well 
known in the art include pETlla (Novagen), lambda gtll (Invitrogen), pcDNAJJ 
(Invitrogen), and pKK223-3 (Pharmacia). Fungal cell expression vectors well known 



-9- 



WO 01/68706 



PCT/USOl/08071 



in the art include pYES2 (Invitrogen) and Pichia expression vector (Invitrogen). 
Insect cell expression vectors well known in the art include Blue Bac M (Invitrogen). 

Recombinant host cells may be prokaryotic or eukaryotic. Examples 
of recombinant host cells include the following: bacteria such as E. coli; fungal cells 
5 such as yeast; mammalian cells such as human, bovine, porcine, monkey, hampster, 
and rodent; and insect cells such as Drosophila and silkwoim derived cell lines. 
Commercially available mammalian cell lines include L cells L-M(TK.sup.-) (ATCC 
CCL 1.3), L cells L-M (ATCC CCL 1.2), 293 (ATCC CRL 1573), Raji (ATCC CCL 
86), CV-1 (ATCC CCL70),COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), 
10 CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), 
HeLa (ATCC CCL 2), C127I (ATCC CRL 1616), BS-C-1 (ATCC CCL 26) and 
MRC-5(ATCC CCL 171). 

To enhance expression in a particular host it may be useful to modify 
the sequence to take into account codon usage of the host. Codon usage of different 
15 organisms are well known in the art. (See, Ausubel, Current Protocols in Molecular 
Biology, John Wiley, 1987-1998, Supplement 33 Appendix 1C.) 

Expression vectors may be introduced into host cells using standard 
techniques. Examples of such techniques include transformation, transfection, 
lipofection, protoplast fusion, and electroporation. 
20 ~ Nucleic acid encoding for a polypeptide can be expressed in a cell 

without the use of an expression vector employing, for example, synthetic mRNA or 
native mRNA. Additionally, mRNA can be translated in various cell-free systems 
such as wheat germ extracts and reticulocyte extracts, as well as in cell based systems, 
such as frog oocytes. Inuoduction of mRNA into cell based systems can be achieved, 
25 for example, by microinjection. 

Techniques for producing transgenic animals are well known in the art. 
Examples of such techniques are provided for by Teratocarcinomas and embryonic 
stem cells: a practical approach. Ed. By E. J. Robertson, IRL Press Limited, Oxford, 
England (1987); and Gene Targeting: a practical approach. Ed. By A. L. Joyner, 
30 Oxford University Press Inc. New York, NY (1993). 

O-Protein Coupled Receptor Assays 

MCH-R is G-protein coupled receptor. Techniques for measuring 
different G-protein activities, such as Gi/o, Gs, and Gq are well known in the art. 
35 MCH-R activity is preferably assayed for by measuring either Gi/o or Gq. 
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Gi/o and Gs activity can be measured using techniques such as a 
melonaphore assay, measuring cAMP production, measuring inhibition of cAMP 
accumulation, and measuring binding of 35S-GTP. cAMP can be measured using 
different techniques such as radioimmunoassay and indirectly by cAMP responsive 

5 gene reporter proteins. 

Gq activity can be measured using techniques such as those measuring 
intracellular Ca 2+ . Examples of techniques well known in the art that can be 
employed to measure Ca 2+ include the use of dyes such as Fura-2 and the use of Ca 2+ - 
bioluminescent sensitive reporter proteins such as aequorin. An example of a cell line 

10 employing aequorin to measure G-protein activity is HEK293/aeql7. (Button et 
al, 1993. Cell Calcium 14, 663-671, and Feighner et al., 1999. Science 284, 2184- 
2188, both of which are hereby incorporated by reference herein.) 

Functional assays can be performed using individual compounds or 
preparations containing different compounds. A preparation containing different 

15 compounds where one or more compounds affect MCH-R chimeric or fusion protein 
activity can be divided into smaller groups of compounds to identify the compound(s) 
affecting MCH-R chimeric or fusion protein activity. In an embodiment of the 
present invention a test preparation containing at least 10 compounds is used in a 
functional assay. 

20 Functional assays can be performed using recombinantly produced 

MCH-R chimeric or fusion protein present in different environments. Such 
environments include, for example, cell extracts and purified cell extracts containing 
the MCH-R chimeric or fusion protein expressed from recombinant nucleic acid and 
an appropriate membrane for the polypeptide; and the use of a purified MCH-R 

25 chimeric or fusion protein produced by recombinant means that is introduced into a 
different environment suitable for measuring G-protein activity. 

Fluorescent Protein Assays 

Fluorescent protein joined to an MCH receptor can be employed to 
30 study different aspects of receptor dynamics including receptor sequestration, receptor 
densitization, and receptor localization. The fluorescent protein can be used in in 

vitro or in vivo systems. 

In vitro applications of fluorescent proteins can be performed using 
techniques well known in the art. Examples of such techniques are provided by Barak 
35 et al., 1997. Mol Pharm. 5, 177-184; Tarasova et al., 1997. J. Biol. Chem. 272, 
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148 17-14824; Lin et al, 1998. Mol. Cell. Endo. 146, 27-37; Tarasova et al., 1998. J. 
Biol. Chem. 273, 15883-15886; Kallal et al, 1998. J. Biol. Chem. 273, 322-328; 
Groake et al, 1999. J. Biol. Chem. 274, 23263-23269; Doherty et al, 1999. 
Biochem. J. 341, 415-422; Brock et al, 1999. Proc. Natl. Acad. Sci. VSA 96, 10123- 
5 10128; Cornea et al., 1999. Endocrinology 140, 4272-4280; and Lembo et al, 1999. 
Nat. Cell Biol 1, 267-271 (these references are not admitted to be prior art to the 

claimed invention). 

In vivo applications of fluorescent proteins can be performed using 
techniques well known in the art. Examples of such techniques are provided by 
10 Mombaerts et al., 1996. Cell 87, 675-686; Rodriquez et al., 1999. Cell 97, 199-208; 
Spergel et al, 1999. /. Neurosci. 1, 2037-2050; and Zuo et al, 1999. Proc. Natl. 
Acad. Sci. USA 96, 14100-14105 (these references are not admitted to be prior art to 
the claimed invention). 

15 EXAMPLES 

Examples are provided below to further illustrate different features and 
advantages of the present invention. The examples also illustrate useful methodology 
for practicing the invention. These examples do not limit the claimed invention. 

20 Ex am ple 1: 

Amino acid and nucleic acid sequence information for SEQ. DD. NOs. 
1-29 are provided below. SEQ. ID. NOs. 1-29 include examples of polypeptide and 
encoding nucleic acid sequences for MCH-R polypeptide regions, fluorescent 
polypeptide regions, fusion proteins and chimeric proteins. In some cases the 
25 encoding nucleic acid is shown with additional nucleic acid upsteam or downstream 
from an open reading frame. 

SEQ. ID. NO. 1: Human long form MCHIR 

MSVGAMKKGVGRAVGLGGGSGCQATEEDPLPNCGACAPGQGGRRWRLPQP 
30 AWVEGSSARLWEQATGTGWMDLEASLLPTGPNASNTSDGPDNLTSAGSPPR 

TGSISYINIIMPSWGTICIXG^ 
iXFLLGMPFMHQIJvlGNGVWHFGET 

yLATVIMSSTKFRKPSVATLVTC^^ 
RIPNPDTDLYWFTLYQFFLAFA^^ 



-12- 



WO 01/68706 



PCT/USOl/08071 



TKRVTRTAIAICLVFFVCWAPYYVLQLTQLSISRPTLTFVYLYNAAISLGYANS 
CLNPFVYWLCETFRKRLVLSVKPAAQGQLRAVSNAQTADEERTESKGT 

SEQ. ID. NO. 2. Human short form MCH1R 

5 MDLEASIXPTGPNASOTSDGPDNLTSAGSPPRTGSIS\TNIIMPSVFGTICLLGnG 

NSTWAVVKKSKIJTWCNNVPDIFRNI^VVDllJ^ 

FGETMCTDTAMDANSQFTSTmTAMAroRYLATVHPISSTKF^SVATLVI 
GLLW ALSFISrrPVWLY ARLIPFPGG AVGCGIRIJ'NPDTDLYWFTLYQFFLAFA 
LPFVVlTAAYVRILQRMrSSVAPASQRSIRIJRTKRVTRTAlAICLVFFVCWAPY 
10 YVLQLTQLSISRPTLTFVYLYNAAISLGYANSCIJ^FVYIVLCETFRKmVLSV 

KPAAQGQLRAVSNAQTADEERTESKGT 
SEQ. ID. NO. 3: Mouse MCH1R 

^^DLQASliSTGPNASmSDGQDNFII.AGPPPRTRSVS\-^^OT^ffSVFGTICLLGI 

1 5 VGNSTVIFA WKKSKLHWCSNVPDIFIINLS VVDLLFLLGMPFMfflQLMGNGV 
WHFGETMCTLITAMDANSQFTSTYrLTAMAIDRYLATVHPISSTKFRKPSMAT 
LVICLLWALSHSITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWFTLYQFFLA 
FALPFVVrrAAYVKILQRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWA 
PYYVLQLTQ^ISRPTLTFVYLYNAAISLGYANSCLNPF^YTVLCETFRKRLVLS 

20 VKPAAQGQLRTVSNAQTADEERTESKGT 

SEQ. n). NO. 4: Human short form/mouse species chimeric MCH1R 

MDLEASIIPTGPNASNTSDGPD^^TSAGSPPRTGSISYI^m^lPSVFGTICLLG^G 

NSTVFAVVKKSKLHWC^^^PDIFIINLSVVDI1J^GMPFMIHQLMGNGVWH 
25 FGETMCTLITAMD ANS QFTS TYILTAMAJDR YLATVHPIS STKFRKPSMATLVI 
CIXWAI^HSITPVWLYARIIPFPGGAVGCG 

LPFVVrTAAYVKILQRMTSSVAPASQRSIRLRTKRVTRTAlAICLVFFVCWAPY 
YVLQLTQLSISRPTLTFVYLYNAAISLGYANSCL^FVYIVLCETFRKRLVLSV 

KPAAQGQLRTVSNAQTADEERTESKGT 

30 

SEQ. ID. NO. 5: Human long form/mouse species chimeric MCH1R 

MSVGAMKKGVGRAVGLGGGSGCQATEEDPLPNCGACAPGQGGRRWRLPQP 
AWVEGSSARLWEQATGTGWMDl^ASLLPTGPNASNTSDGPDNLTSAGSPPR 

TG SISYINIIMPSVFGTICIXGnGNSTVnAWKra 
35 IXEIXGMPFMIHQLMGNGVWHFGETMCTLITAMDANS QFTST YILTAMAIDR 
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YLATVHPISSTKFRKPSMATLVICIX^ 

RLPNPDTDLYWFI1.YQFFLAFALPFVWAAYVKE.QRMTSSVAPASQRSIRLR 
TKKVTRTAIAICLVITVCWAPYY^QLTQI^ISRPTLTFVYLYNAAISLGYANS 
CWFVYIVLCETFRKRLVI^VKPAAQGQIJRTVSNAQTADEERTESKGT 

5 

SEQ. ID. NO. 6: GFP 

MSKGEEIJTGVWILVEIJ)GDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKL 
PWWFILVTTFSYGVQCFSRYPDHMKQHDFEKSAMPEGYVQERTIFPKDDGN 
YKTRAEVKI^GDTLVNRIELKGIDF^DGNILGHKLEYNYNSHNVYIMADKQ 
10 KNGKVNFXIRHNffiDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKD 
PNEKRDHMVLLEFVTAAGITHGMDELYK 

SEQ. ID. NO. 7: EGFP 

MVSKGEELJTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKHCTTGK 
1 5 LPWWPTLVTTT.TYGVQCFSRYPDHMKQEODFPKSAMPEGYVQERTIFEKDDG 
hT^TRAEVKFEGDTLVNRffiLKGIDFKEDGNILGHKI^YNYNSHNVYIMADK 
QKNGKVNHORHNIEDGSVQLADHYQQNTPIGDGPV^ 
DPNEKRDHMVLLEFVTAAGITLGMDELYK 

20 SEQ. ID. NO. 8: Emerald 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Be Leu Val Glu Leu Asp 
Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr 
Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 

25 Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr Be Phe 
Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr 
Leu Val Asn Arg He Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly 
His Lys Leu Glu Tyr Asn Tyr Asn Ser His Lys Val Tyr He Thr Ala Asp Lys Gin Lys 
Asn Gly Ee Lys Val Asn Phe Lys Thr Arg His Asn Ee Glu Asp Gly Ser Val Gin Leu 

30 Ala Asp His Tyr Gin Gin Asn Thr Pro Be Gly Asp Gly Pro Val Leu Leu Pro Asp Asn 
His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly Be Thr Leu Gly Met Asp Glu Leu Tyr Lys 
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SEQ. ID. NO. 9: T paz 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Be Leu Val Glu Leu Asp 
Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr 
Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 

5 Leu Val Thr Thr Phe Gly Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Arg 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe 
Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr 
Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys 

10 Asn Gly lie Lys Val Asn Phe Lys Be Arg His Asn He Glu Asp Gly Ser Val Gin Leu 
Ala Asp His Tyr Gin Gin Asn Thr Pro Be Gly Asp Gly Pro Val Leu Leu Pro Asp Asn 
His Tyr Leu Ser Tyr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 

15 SEQ. H). NO. 10: W1B 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp 
Gly Asp Val Asn Gly His Arg Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr 
Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
Leu Val Thr Thr Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

20 Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe 
Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr 
Leu Val Asn Arg Be Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn Be Leu Gly 
His Lys Leu Glu Tyr Asn Tyr Be Ser His Asn Val Tyr Be Thr Ala Asp Lys Gin Lys 
Asn Gly Be Lys Ala His Phe Lys Be Arg His Asn Be Glu Asp Gly Ser Val Gin Leu 

25 Ala Asp His Tyr Gin Gin Asn Thr Pro Be Gly Asp Gly Pro Val Leu Leu Pro Asp Asn 
His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly Be Thr Leu Gly Met Asp Glu Leu Tyr Lys 

SEQ. ID. NO. 11: Mouse MCHlR-linker-EGFP 

30 MDLQASLLSTGPNASNISDGQDNFTI^GPPPRTRSVSYBmiPSVFGTICLLGI 

VGNS T VIF A V VKKS KLHWCS NVPDBTINLS V VDLLFLLGMPFMIHQLMGNG V 
WHFGETMCTLrTAMDANSQFTSTmTAMAIDRYLATVHPISSTKFRKPSMAT 
LVICIXWALSHSrTPVWLYARLffFPGGAVGCGIPXPNPDTDLYWFrLYQFPLA 
F ALPF VV1T A A YVKILQRMTS S V AP AS QRSH^LJITKR WRTAL\ICLVFFVC WA 
35 p Y YVLQLTQLSISRPTLTFV YLYNAAISLGYANS CUSTPFV YTVLCETFRKRLVLS 
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VKPAAQGQLRTVSNAQTADEERTESKGTVDGTAGPGSIATMVSKGEELFTGV 
VPILVEIX)GDVNGHKFSVSGEGEGDATYGKLTLJa^CTTGiaJWWPTLVTI^ 
TYGVQCFSRYPDHMKQHDFEKSAMPEGYVQERTIFEKDDGNYKTRAEVKFE 
GDTLVNRffiLKGroFKEDGNILGHKlEYNYNSHNVYIMADKQKNGIKVNFKIR 

5 H^nEDGSVQLADHYQQNTPIGDGPVIJ^ 
LEFVTAAGITLGMDELYK 

SEQ. ID. NO. 12: Mouse MCH1R/EGFP direct fusion 

MDLQASIl^TGPNASMSDGQDNFTlAGPPPRTllSVSYlNnWSWGTICllXJI 

10 VGNSTVIFAWKKSKIOTCSNWDIFIINLSVTO^ 

WHFGEmCTIJ^AMDANSQFTSTmTAMAroRYLATVHPISSTKFRKPSMAT 
LVICLLW ALSHSITP VWLYARLIPFPGGAVGCGn^NPDTDLYWFIEYQFFLA 
FALPFVWAAYVKILQRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWA 
PYYVLQLTQLSISRPTLTFVYLTOAAISLGYANSCLNWVYIVLCETFRKRLVLS 

15 vKPAAQGQmTVSNAQTADEERTESKGTMVSKGEELFTGVVPILVELDGDVN 
GHKFSVSGEGEGDATYGKLTLKHCTTGKLPVPWPTLVTTLTY GVQCFSRYPD 
HMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGI 

DFKEDGNILGHKIEYNYNSHNVYIMADKQKNGIKVNFK^ 
HYQQNTPIGDGPVLLPDNHYLSTQSAI^KDPNEKRDHMVIIEFVTAAGITLG 

20 MDELYK 

SEQ. ID. NO. 13: Human short form/mouse species chimeric MCHlR-linker- 
EGFP 

MDlEASlEPTGPNASNTSDGPDNLTSAGSPPRTGSISYlNIIMPSVFGTICLLGnG 

25 NSTVIFAWKKSKLHWCNNWDIFII^^ 

FGETMCTLIT AMD ANS QFTSTYILTAMAIDR YLATVHPIS STKFRKPSMATLVI 
CLLWALSHSITPVWLYARLIPFPGGAVGCGIRLPNPDTDL\^WFTLYQFFLAFA 
LPFWrTAAYVKILQRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWAPY 
YVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFV\TVLCETFRKRLVLSV 

30 KPAAQGQLRTVSNAQTADEERTESKGTVDGTAGPGSIATMVSKGEELFTGVV 
P^LDGDVNGHKFSVSGEGEGDATYGKLTLmCTTGKLPVPWPTLVTTLT 
YGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEG 
DTLVNRIELKGroFKEDGNILGimEYNYNSHNVYIMADKQKNGIKVNFKI^ 

NIEDGSVQLADHYQQNTPIGDGPVIX^ 
35 EFVTAAGITLGMDELYK 



-16- 



PCTAJSOl/08071 

WO 01/68706 



SEQ. H>. NO. 14: Human long form/mouse species chimeric MCHlR-linker- 
EGFP 

MSVGAMKKGVGRAVGLGGGSGCQATEEDPLPNCGACAPGQGGRRWRLPQP 
AWVEGSSARLWEQATGTGWMDLEASLLPTGPNASNTSDGPDNLTSAGSPPR 
TGSIS\TimiPSWGTICIXGnGNSTVFAVVKKSKIJIWCNNW 
IXHiGMPFNDHQJJVIGNGVWHFGETMCTIJr 

YLATVHPISSTKFP^SMATLVICII.WALSnsriPVWLYAP^FPGGAVGCGI 
RIJ'NPDTDLYWFTLYQFTlAFALPFYVff 

T^VTRTAMICLVFFVCWAPYYVLQLTQLSISRPTLTFVYLYNAAISLGYANS 
CWFVYIVLCETFRKRLVLSVKPAAQGQLRTVSNAQTADEERTESKGTVDGT 
AGPGSIATMVSKGEEIPTGVWILVEII)GDVNGHKFSVSGEGEGDATYGICLT 
LT^CTTGKIJWWPTLVTTLTYGVQCFSRYPDHMKQHDFEKSAMPEGYVQE 

rTIFFKDDGNYKTRAEVKFEGDTLVNRIEL^^ 

NVYINlAJDKQKNGIKVNFKIRHlSriE^ 
LSTQSALSKDPNEKRDHMVJXEFVTAAGrTLGMDELYK 

SEQ. ID. NO. 15: Human long form MCH1R cDNA 

ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG 

GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC 

GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC 

CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG 

CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAACG 

CCAGCAACACCTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCT 

CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTC 

GGCACCATCTGCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCG 

GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT 

CATCATCAACCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTT 

CATGATCCACCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCA 

TGTGCACCCTCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACC 

TACATCCTGACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCGACCCC 

ATCTCTTCCACGAAGTTCCGGAAGCCCTCTGTGGCCACCCTGGTGATCTGC 

CTCCTGTGGGCCCTCTCCTTCATCAGCATCACCCCTGTGTGGCTGTATGCC 

AGACTCATCCCCTTCCCAGGAGGTGCAGTGGGCTGCGGCATACGCCTGCC 

CAACCCAGACACTGACCTCTACTGGTTCACCCTGTACCAGTTTTTCCTGGC 
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CTTTGCCCTGCCTTTTGTGGTCATCACAGCCGCATACGTGAGGATCCTGCA 
GCGCATGACGTCCTCAGTGGCCCCCGCCTCCCAGCGCAGCATCCGGCTGC 
GGACAAAGAGGGTGACCCGCACAGCCATCGCCATCTGTCTGGTCTTCTTT 
GTGTGCTGGGCACCCTACTATGTGCTACAGCTGACCCAGTTGTCCATCAGC 
5 CGCCCGACCCTCACCTTTGTCTACTTATACAATGCGGCCATCAGCTTGGGC 
TATGCCAACAGCTGCCTCAACCCCTTTGTGTACATCGTGCTCTGTGAGACG 
TTCCGCAAACGCTTGGTCCTGTCGGTGAAGCCTGGAGCCCAGGGGCAGCT 
TCGCGCTGTCAGCAACGCTCAGACGGCTGACGAGGAGAGGACAGAAAGC 

AAAGGCACCTGA 

10 

SEQ. ID. NO. 16: Human short form MCH1R cDNA 

ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC 
CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG 
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT 

15 GCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG 
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA 
CCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTTCATGATCCA 
CCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC 
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG 

oo ACCGCCATGGCCATTGAGCGCTACCTGGCCACTGTCCACCCCATCTCTTCC 
ACGAAGTTCCGGAAGCCCTCTGTGGCCACCCTGGTGATCTGCCTCCTGTGG 
GCCCTCTCCTTCATCAGCATCACCCCTGTGTGGCTGTATGCCAGACTCATC 
CCCTTCCCAGGAGGTGCAGTGGGCTGCGGCATACGCCTGCCCAACCCAGA 
CACTGACCrCTACTGGTTCACCCTGTACCAGTTTTTCCTGGCCTTTGCCCTG 

25 CCTTTTGTGGTCATCACAGCCGCATACGTGAGGATCCTGCAGCGCATGAC 
GTCCTCAGTGGCCCCCGCCTCCCAGCGCAGCATCCGGCTGCGGACAAAGA 
GGGTGACCCGCACAGCCATCGCCATCTGTCTGGTCTTCTTTGTGTGCTGGG 
CACCCTACTATGTGCTACAGCTGACCCAGTTGTCCATGAGCCGCCCGACCC 
TCACCTTTGTCTACTTATACAATGCGGCCATCAGCTTGGGCTATGCCAACA 

30 GCTGCCTCAACCCCTTTGTGTACATCGTGCTCTGTGAGACGTTCCGCAAAC 
GCTTGGTCCTGTCGGTGAAGCCTGCAGCCCAGGGGCAGCTTCGCGCTGTC 
AGCAACGCTCAGACGGCTGACGAGGAGAGGACAGAAAGCAAAGGCACCT 

GA 
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SEQ- ID. NO. 17: Mouse MCH1R cDNA 

Nucleic acid sequence start and stop codons are highlighted: 

GGCGGTAGAGGAAGACCCTTTTCTGGACTGCGGGGCTCAAGCTCCGGACA 
AGGCGGTGGAGGGCGCTGGAGGCTGCCGCAGCCTGCGTGGGTGGACGGG 

5 CGCTCCACTCCAGGGAGCAGGCGACCTGCACCGGCTGCATGGATCTGCAA 
GCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACATCTCCGATGGCCA 
GGATAATTTCACATTGGCGGGGCCACCTCCTCGCACAAGGAGTGTCTCCT 
ACATCAACATCATCATGCCTTCAGTGTTTGGTACCATCTGTCTCCTGGGCA 
TTGTGGGAAACTCCACAGTCATTTTTGCCGTGGTGAAGAAATCCAAGCTG 

10 CACTGGTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTG 
GATCTGCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGT 
AATGGTGTCTGGCACTTTGGGGAAACCATGTGCACCCTCATCACAGCCAT 
GGACGCCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCA 
TTGACCGCTACTTGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGA 

15 AGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCA 
TTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGG 
GTGCTGTGGGCrGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACT 
GGTTCACTCTGTATCAGTTTTTCCTGGCCTTCGCCCTTCCGTTTGTGGTCAT 

cactgctgcgtacgtgaaaatactacagcgcatgacgtcttcggtggccc 

20 CAGCCTCTCAACGCAGGATCCGGCTTCGGACAAAGAGGGTGACCCGCACA 
GCCATTGCCATCTGTCTGGTCTTCnTGTGTGCTGGGCGCCCTACTACGTG 

ctgcagctgacccagttgtccatcagccgcccgaccctcacattcgtctac 

CTGTACAATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCC 
CmGTGTACATAGTACTCTGTGAGACCTITCGAAAACGCTTGGTGCTGTC 

25 GGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGA 
CAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACCTGACAATCCCCCCC 
GGTCACCTCCAAGTCAGGTCACCGCATCAAACCATGGGGAGAGATACTGA 
GATAAACCCGGGGCTACCCTGGGAGGATGCAGAAGCTGGAGGCTGGGGG 
CTTGTAGCAAACCACATTCCACGGGGCCCACAAATTGCTAGGGAGGCTTG 

30 CAGCCTGGTTTGGGGGGGAAGCCTCAGACTGCAGGGATCCCCTTGACAGA 
ATAGAAGCGGAGCAAGAAGGAAAGGGTGGTTTGACTGGTTCTCGGGGTCT 
GTATCTGTTGGCTCGCATATATCTTTCTCTCAAGGGAAGAAGGCGGAGGT 

gcctagctgggttcctttaaaActaggcagggctaggatctgagcagcta 

GGGCTCTACTGTGAGACTGGGCAAGCCGAGCGTTCCCTCCCATCTCTCATT 
35 GGTGTTGATAGAAGGCAGTCTTTCTCCCAAGCTGGTGGATCTCCTGAAGC 
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ACGCTGCCTGGGCTCCAGCATCCTGTGCGGATTTCACGTTCTCTTTAGGGG 
ATGCATGTTGACACTGGGGTGTGGGCTCTGAGCCCACAGGAGTTTAAAAA 
ACCAAAAGAGCTGAGAGTGTCGAGAGAGACCCAATCACCGAGAATGACA 
AGGCAACCTGGGGTGGATGTGGATCTTGAAACTAATAAAAAGGGGTTTTC 
5 ACAGTGACAGCGACATTCTCTTCATAGGGCACAGCTGTCAGTCTATGGCT 
GATCCAGAGCGAGCATCCATGAATTCTGCATGTGCAGGGGTCACTCTAAT 
ACCrGATATGTTGGCATCATCTTTGTGCTTGAGCGTTCCNCTCCCAAATGG 
GAATGAAATAAAGGCAAATTCCCNCCCCCCCCAAAAAAGGGGNAAAAAA 

AAAAAAAAAAAAAAAAAAAA 

SEQ. ID. NO. 18: Mouse MCH1R genomic DNA 

Nucleic acid sequence start and stop codons, as well as intron borders, are 
highlighted: 

GGCGGTAGAGGAAGACCCTTTTCTGGACTGCGGGGCTCAAGCTCCGGACA 

15 AGGCGGTGGAGGGCGCTGGAGGCTGCCGCAGCCTGCGTGGGTGGACGGG 
CGCTCCACTCCAGGGAGCAGGCGACCTGCACCGGCTGCATGGATCTGCAA 
GCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACATCTCCGATGGCCA 
GGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTCCTCCCTCCTCCG 
GGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAGCCAAACTGCCTG 

20 GGAAACTITATC1TACAGTTCTTGGTGATAAGATCTGCAGTCGGCTTTGCC 
TGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGACAGAAGGGGCA 
GGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAACAACAAGGGTTG 
GTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGCTGATGAGTCCCA 
AATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGGCTGGAGAAGCAA 

25 GCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGTGGGAGCCAGCAG 
AAGCGCrTTGATTCACCGCTATCCTGGGCTCAATCCTCTGGCCTCGCACTG 
GGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGCAAAGGCTGGTGG 
GAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTTTCTGGTGAATTA 
AGCTTCTTGACATTTGCAGAACGTCAATGCCTTAAAATTCTAGCTCTGAAG 

30 GAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGTGGAGAAATTCCC 
AAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCTATTGGGAAAGCC 
CAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACAGAAAACCGGGAG 
AGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACCCATGGTGACAGTC 
ACTTCTCACATATGCCTAGGAACCTATCTGAAACCTTTGGCCATCTCTCTC 

35 TGAAAAGATGAGGCTGCAAATACACACACACACACACACACACACACAC 
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ACACACACACACACACACACACACACACACACACAAATGTCCTTCAAGCC 
TTTTTGACAAGGTTTTCTGGTGGATCCCGGGGATATGAAGTTGTTCTCAGC 
AGATATCTGGGAGTCTTGACTCCTGGCCCTCTGAGTAAATGGATGAAGCG 
AAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTAGAAAATCCTATAG 
5 GAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACAGAACTAACAATAG 
CCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAGACGTAGTGACCCC 
AAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAGGCCGACATGGCAG 
GTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTGCTCTTTCTATCCTC 
AGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATCAACATCATCATGC 
10 CTTCAGTGTTTGGTACCATCTGTCTCCTGGGCATTGTGGGAAACTCCACAG 
TCATTTTTGCCGTGGTGAAGAAATCCAAGCTGCACTGGTGCAGCAACGTC 
CCTGACATOTCATCATCAACCTCTCTGTGGTGGATCTGCTTTTCCTGCTGG 
GCATGCCTTTCATGATCCACCAGCTCATGGGTAATGGTGTCTGGCACTTTG 
GGGAAACCATGTGCACCCTCATCACAGCCATGGACGCCAACAGTCAGTTC 
15 ACCAGCACCTACATCCTGACTGCTATGGCCATTGACCGCTACTTGGCCACC 
GTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCTCCATGGCCACCCTG 
GTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCATCACTCCTGTGTGG 
CTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATC 
CGCCTACCAAACCCAGATACTGATCnTACTGGTrCACTCTGTATCAGTTT 
20 TTCCTGGCCTTCGCCCTTCCGTTTGTGGTCATCACTGCTGCGTACGTGAAA 
ATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCTCTCAACGCAGCAT 
CCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGG 
TCTTCTTTGTGTGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGT 
CCATCAGCCGCCCGACCCTCACATTGGTCTACCTGTACAATGCGGCCATCA 
25 GCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCT 
GTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAG 
GGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTGACGAGGAGAGGA 
CAGAAAGCAAAGGCACCTGACAATCCCCCCCGGTCACCTCCAAGTCAGGT 
CACCGCATCAAACCATGGGGAGAGATACTGAGATAAACCCGGGGCTACC 
30 CTGGGAGGATGCAGAAGCTGGAGGCTGGGGGCTTGTAGCAAACCACATTC 
CACGGGGCCCACAAATTGCTAGGGAGGCTTGCAGCCTGGTTTGGGGGGGA 
AGCCTCAGACTGCAGGGATCCCCTTGACAGAATAGAAGCGGAGCAAGAA 
GGAAAGGGTGGTTTGACTGGTTCTCGGGGTCTGTATGTGTTGGCTCGCATA 
TATCTTTCTCTCAAGGGAAGAAGGCGGAGGTGCCTAGCTGGGTTCCTTTA 
35 AAACTAGGCAGGGCTAGGATCTGAGCAGCTAGGGCTCTACTGTGAGACTG 
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GGCAAGCCGAGCGTTCCCTCCCATCTCTCATTGGTGTTGATAGAAGGCAG 
TCTTTCTCCCAAGCTGGTGGATCTCCTGAAGCACGCTGCCTGGGCTCCAGC 
ATCCTGTGCGGATTTCACGTTCTCTTTAGGGGATGCATGTTGACACTGGGG 
TGTGGGCTCTGAGCCCACAGGAGTTTAAAAAAGCAAAAGAGCTCAGAGTG 

5 TCGAGAGAGACCCAATCACCGAGAATGACAAGGCAACCTGGGGTGGATG 
TGGATCTTGAAACTAATAAAAAGGGGTTTTCACAGTGACAGCGACATTCT 
CTTCATAGGGCACAGCTGTCAGTCTATGGGTGATCCAGAGCGAGCATCCA 
TGAATTCTGCATGTGCAGGGGTCACTCTAATACCTGATATGTTGGCATCAT 
CTTTGTGCTTGAGCCTTCCNCTCCCAAATGGGAATGAAATAAAGGCAAAT 

10 TCCCNCCCCCCCCAAAAAAGGGGNAAAAAAAAAAAAAAAAAAAAAAAA 

AA 

SEQ. ED. NO. 19: Human short form/mouse species chimeric MCHIR 

ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC 

15 CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG 
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT 
GCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG 
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA 
CCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTTCATGATCCA 

20 CCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC 
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACGTACATCCTG 
ACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCCATCTCTTCC 
ACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTG 
GGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTAT 

25 CCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAG 
ATACTGATCTTTACTGGTTCACTCTGTATCAGTTTTTCCrGGCCTTCGCCCT 
TCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGCGCATGAC 
GTCTTCGGTGGCCCCAGCCTCTGAACGCAGCATCCGGCTTCGGACAAAGA 
GGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTrCTTTGTGTGCTGGG 

30 CGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACC 
CTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTATGCCAAC 
AGCTGCCTCAATCCCTTTGTGTACATAGTACrCTGTGAGACCTTTCGAAAA 
CGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGT 
CAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACC 

35 TGA 
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SEQ. ID. NO. 20: Human long form/mouse species chimeric MCH1R 

ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG 
GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC 

5 GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC 
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG 
CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAACG 
CCAGCAACACCTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCT 
CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTC 

0 GGCACCATCTGCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCG 
GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT 
CATCATCAACCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTT 
CATGATCCACCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCA 
TGTGCACCCTCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACC 

L5 TACATCCTGACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCC 
ATCTCTTCCACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGC 
CTCCTGTGGGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCC 
AGGCTTATCCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCA 

aacccagatactgatctttactggttcactctgtatcagtttttcctggcct 
20 tcgcccttccgtttgtggtcatcactgctgcgtacgtgaaaatactacagc 

GCATGACGTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGG 
ACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTG 
TGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGC 
CCGACCCTCACATTCGTCTACCTGTAGAATGCGGCCATCAGCTTGGGCTAT 
25 GCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTT 
CGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCG 
CACGGTCAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAA 

GGCACCTGA 

30 SEQ. ID. NO. 21: Aequorea victoria Green Fluorescent Protein (GFP) cDNA 
Nucleic acid sequence start and stop codons are highlighted: 

TACACACGAATAAAAGATAACAAAGATGAGTAAAGGAGAAGAACmTC 
ACTGGAGTTGTCCCAATTCrTGTTGAATTAGATGGTGATGTTAATGGGCAC 
AAATTTTCTGTCAGTGGAGAGGGTGAAGGTGATGCAACATACGGAAAACT 
35 TACCCTTAAATTTATTTGCACTACTGGAAAACTACCTGTTCCATGGCCAAC 
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ACTTGTCACTACTTTCTCTrATGGTGTTCAATGCTTTTCAAGATACCCAGAT 
CATATGAAACAGCATGACTTTTTCAAGAGTGCCATGCCCGAAGGTTATGT 
ACAGGAAAGAACTATATTTTTCAAAGATGACGGGAACTACAAGACAGGTG 
CTGAAGTCAAGTTTGAAGGTGATACCCTTGTTAATAGAATCGAGTTAAAA 

5 GGTATTGATTTTAAAGAAGATGGAAACATTCTTGGACACAAATTGGAATA 
CAACTATAACTCACACAATGTATACATCATGGCAGACAAACAAAAGAATG 
GAATCAAAGTTAACrrCAAAATTAGACACAACATrGAAGATGGAAGCGTT 
CAACTAGCAGACCATTATCAACAAAATACTCCAATTGGCGATGGCCCTGT 
CCTTTTACCAGACAACCATTACCTGTCCACACAATCTGCCCTTTCGAAAGA 

10 TCCCAACGAAAAGAGAGACCACATGGTCCITCTTGAGTTTGTAACAGCTG 
CTGGGATTACACATGGCATGGATGAACTATACAAATAAATGTCCAGACTT 
CCAATTGACACTAAAGTGTCCGAACAATTACTAAAATCTCAGGGTTCCTG 
GTTAAATTCAGGCTGAGATATTATTTATATATTTATAGATTCATTAAAATT 
GTATGAATAATTTATTGATGTTATTGATAGAGGTTATTTTCTTATTAAACA 

15 GGCTACTTGGAGTGTATTCITAATTCTATATTAATrACAATTTGATTTGACT 

TGCTCAAA 

SEQ. ID. NO. 22: EGFP + Linker 

Nucleic acid sequence start and stop codons are highlighted and a 12 amino acid 

20 linker sequence is denoted in lower case: 

gtcgacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGAGCTGTT 

CACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCC 

ACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAA 

GCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGC 

25 CCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTAC 
CCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGG 
CTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGA 
CCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAG 
CTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGC 

30 TGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAG 
AAGAAGGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACG 
GCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGAC 
GGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCT 
GAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCG 
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TGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGC 
GGCCGC 

SEQ. ID. NO. 23: Emerald 
5 ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT 

CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG 

GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC 

CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCTTGACCT 

ACGGCGTGCAGTGCTTCGCCCGCTACCCCGACCACATGAAGCAGCACGAC 

10 TTCITCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC 

ttcaaggacgacggcaactacaaGacccgcgccgaggtgaagttcgagg 

GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG 
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACA 
AGGTCTATATCACCGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTC 
15 AAGACCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACT 
ACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAAC 
CACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCG 
CGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCG 

GCATGGACGAGCTGTACAAGTAA 

20 

SEQ. ID. NO. 24: Topaz 

ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT 

CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG 

GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC 

25 CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCTTCGGCT 
ACGGCGTGCAGTGCTTCGCCCGCTACCCCGACCACATGCGCCAGCACGAC 
TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC 
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG 
GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG 

30 GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACA 
ACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTC 
AAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTA 
CCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACC 
ACTACCTGAGCTACCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGC 
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GATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGG 
CATGGACGAGCTGTACAAGTAA 

SEQ.D).N0.25:W1B 

5 ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT 

CGAGCTGGACGGCGACGTAAACGGCCACAGGTTCAGCGTGTCCGGCGAG 
GGCGAGGGCGATGCCACCTACGGCAAGCTGAGCCTGAAGTTCATCTGCAC 
CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT 
GGGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC 

10 TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGTACCATCTTC 
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG 
GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG 
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACATCAGCCACA 
ACGTCTATATCACCGCCGACAAGCAGAAGAACGGCATCAAGGCCCACTTC 

15 AAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTA 
CCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACC 
ACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGC 
GATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGG 

CATGGACGAGCTGTACAAGTAA 

20 

SEQ. ID. NO. 26: Mouse MCHlR-linker-EGFP 

Nucleic acid sequence start codon and start and stop codons for mouse MCH1R and 
EGFP, respectively, as well as intron borders, are highlighted and a 12 amino acid 

linker sequence is denoted in lower case: 
95 ATGGATCTGCAAGCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACAT 

CTCCGATGGCCAGGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTC 

CTCCCTCCTCCGGGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAG 

CCAAACTGCCTGGGAAACTTTATCTTACAGTTCTTGGTGATAAGATCTGCA 

GTCGGCTTTGCCTGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGA 

30 CAGAAGGGGCAGGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAAC 
AACAAGGGTTGGTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGC 
TGATGAGTCCCAAATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGG 
CTGGAGAAGCAAGCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGT 
GGGAGCCAGCAGAAGCGCTTTGATTCACCGCTATCCTGGGCTCAATCCTC 

35 TGGCCTCGCACTGGGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGC 
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AAAGGCTGCTGGGAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTT 

TCTGGTGAATTAAGCTTCTTGACATTTGCAGAACGTCAATGCCTTAAAATT 

CTAGCTCTGAAGGAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGT 

GGAGAAATTCCCAAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCT 

ATTGGGAAAGCCCAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACA 

GAAAACCGGGAGAGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACC 

CATGGTGACAGTCACTTCTCACATATGCCTAGGAACCTATCTGAAACCnT 

GGCCATCTCTCTCTGAAAAGATGAGGCTGCAAATACACACACACACACAC 

ACACACACACACACACACACACACACACACACACACACACACACACAAA 

TGTCCTTCAAGCCTTTTTGACAAGGTTTTCTGGTGGATCCCGGGGATATGA 

AGTTGTTCTCAGCAGATATCTGGGAGTCTTGACTCCTGGCCCTCTGAGTAA 

ATGGATGAAGCGAAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTA 

GAAAATCCTATAGGAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACA 

GAACTAACAATAGCCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAG 

ACGTAGTGACCCCAAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAG 

GCCGACATGGCAGGTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTG 

CTCnTCTATCCTCAGGGCCACCTCCTCGCACAAGGAGTGTCTGCTACATC 

AACATCATCATGCCTTCAGTGTTTGGTACCATCTGTCTCCTGGGCATTGTG 

GGAAACTCCACAGTCATTTTTGCCGTGGTGAAGAAATCCAAGCTGCACTG 

GTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCT 

GCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGTAATGG 

TGTCTGGCACTTTGGGGAAACCATGTGCACCCTCATCACAGCCATGGACG 

CCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCATTGACC 

GCTACTTGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCT 

CCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCA 

TCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTG 

TGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACTGGTTCA 

CTCTGTATCAGTTTTTCCTGGCCTTCGCCCTTCCGTTTGTGGTCATCACTGC 

TGCGTACGTGAAAATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCT 

CTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATT 

GCCATCTGTCTGGTCTTCTTTGTGTGCTGGGCGCCCTACTACGTGCTGCAG 

CTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTAC 

AATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTG 

TACATAGTACTCTGTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAA 

GCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTG 
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ACGAGGAGAGGACAGAAAGCAAAGGCACCgtcgacggtaccgcgggcccgggatccatcg 
ccaccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCC 
TGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCGGGC 
GAGGGCGAGGGGGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTG 

5 cACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGA 
CCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCAC 
GACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCAT 
CTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCG 
AGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAG 

10 GAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCC 
ACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAA 
CTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACC 
ACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGAC 
AACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAA 

15 GCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTC 
TCGGCATGGACGAGCTGTACAAGTAA 

SEQ. ID. NO. 27: Mouse MCH1R/EGFP direct fusion 

Nucleic acid sequence start codon and start and stop codons for mouse MGH1R and 

20 EGFP, respectively, as well as intron borders, are highlighted: 

ATGGATCTGCAAGCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACAT 

CTCCGATGGCCAGGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTC 

CTCCCTCCTCCGGGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAG 

CCAAACTGCCTGGGAAACTTTATCTTACAGTTCTTGGTGATAAGATCTGCA 

25 GTCGGCTTTGCCTGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGA 
CAGAAGGGGCAGGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAAC 
AACAAGGGTTGGTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGC 
TGATGAGTCCGAAATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGG 
CTGGAGAAGCAAGCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGT 

30 GGGAGCCAGCAGAAGCGCTTTGATTCACCGCTATCCTGGGCTCAATCCTC 
TGGCCTCGCACTGGGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGC 
AAAGGCTGCTGGGAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTT 
TCrGGTGAATTAAGCTTCITGACATTTGCAGAACGTCAATGCCTTAAAATT 
CTAGCTCTGAAGGAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGT 

35 GGAGAAATTCCCAAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCT 
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ATTGGGAAAGCCCAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACA 
GAAAACCGGGAGAGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACC 
CATGGTGACAGTCACTTCTCACATATGCCTAGGAACCTATCTGAAACCTTT 
GGCCATCTCTCTCTGAAAAGATGAGGCTGCAAATACACACACACACACAC 
5 ACACACACACACACACACACACACACACACACACACACACACACACAAA 
TGTCCTTCAAGCCTTTTTGACAAGGTTTTCTGGTGGATCCCGGGGATATGA 
AGTTGTTCTCAGCAGATATCTGGGAGTCTrGACTGCTGGCCCTCTGAGTAA 
ATGGATGAAGCGAAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTA 
GAAAATCCTATAGGAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACA 
10 GAACTAACAATAGCCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAG 
ACGTAGTGACCCCAAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAG 
GCCGACATGGCAGGTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTG 
CTCnTCTATCCTCAGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATC 
AACATCATCATGCCTrCAGTGTTTGGTACCATCTGTCTCCTGGGCATTGTG 
15 GGAAACTCCACAGTCATTTTTGCCGTGGTGAAGAAATCCAAGCTGCACTG 
GTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCT 
GCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGTAATGG 
TGTCTGGCACTTTGGGGAAACCATGTGCACCCTCATCACAGCCATGGACG 
CCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCATTGACC 
20 GCTACTTGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCT 
CCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCA 
TCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTG 
TGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACTGGTTCA 
CTCTGTATCAGTTTTTCCTGGCCTTCGCCCTTCCGTTTGTGGTCATCACTGC 
25 TGCGTACGTGAAAATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCT 
CTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATT 
GCCATCTGTCTGGTCITCTTTGTGTGCTGGGCGCCCTACTACGTGCTGCAG 
CTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTAC 
AATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTG 
30 TACATAGTACTCTGTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAA 
GCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTG 
ACGAGGAGAGGACAGAAAGCAAAGGCACCATGGTGAGCAAGGGCGAGG 
AGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTA 
AACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCT 
35 ACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTG 
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CCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAG 
CCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGC 
CCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAAC 
TACAAGACCCGCGCCGAGGTGAAGTTCGAGGGGGACACCCTGGTGAACC 

5 GCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGG 
GCACAAGCTGGAGTACAACTACAACAGCCAGAACGTCTATATCATGGCCG 
ACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACAT 
CGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCC 
ATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCA 

in GTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGC 
TGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTAC 

AAGTAA 

SEQ. H). NO. 28: Human short form/mouse species chimeric MCHlR-linker- 

Nucleic acid sequence start codon and start and stop codons for mouse MCH1R and 
EGFP, respectively, are highlighted and a 12 amino acid linker sequence is denoted in 
lower case: 

ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC 
20 CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG 
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT 
GCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG 
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA 
CCTCTGGGTAGTAGATCTCCTCnTCTCCTGGGCATGCCCTTCATGATCCA 
25 CCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC 
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG 
ACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCCATCTCTTCC 
ACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTG 
GGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTAT 

an CCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAG 
ATACrGATCTTTACTGGTTCACTCTGTATCAGTTTTTCCTGGCCTTCGCCGT 
TCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGCGCATGAC 
GTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGGTTCGGACAAAGA 
GGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCITCTTTGTGTGCTGGG 

35 cGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACC 
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CTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTATGCCAAC 
AGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTTCGAAAA 
CGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGT 
CAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACC 

5 gtcgacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGAGCTGTT 
CACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCC 
ACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAA 
GCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGC 
CCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTAC 

10 CCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGG 
CTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGA 
CCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAG 
CTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGC 
TGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAG 

15 AAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACG 
GCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGAC 
GGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCT 
GAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCG 
TGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAA 

20 

SEQ. ID. NO. 29: Human long form/mouse species chimeric MCHlR-linker- 
EGFP 

Nucleic acid sequence start codon and start and stop codons for mouse MCH1R and 
EGFP, respectively, are highlighted and a 12 amino acid linker sequence is denoted in 
25 lowercase: 

ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG 
GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC 
GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC 
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG 

30 CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAACG 
CCAGCAACACCTCTGATGGCCCCGATAACCTCACTTCGGCAGGATGACCT 
CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTC 
GGCACCATCTGCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCG 
GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT 

35 CATCATCAACCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTT 
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CATGATCCACCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCA 
TGTGCACCCTCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACC 
TACATCCTGACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCC 
ATCTCTTCCACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGC 

5 CTCCTGTGGGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCC 
AGGCTTATCCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCA 
AACCCAGATACTGATCTITACTGGTTCACTCTGTATCAGTTTTTCCTGGCCT 
TCGCCCTTCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGC 
GCATGACGTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGG 
10 ACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTrcnTGTG 
TGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGC 
CCGACCCTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTAT 
GCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTT 
CGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCG 

15 CACGGTCAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAA 
GGCACCgtcgacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGA 
GCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGAGGTAA 
ACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTAC 
GGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCC 

20 CTGGCCCACCCTCGTGAGCACCCTGACCTACGGCGTGCAGTGCTTCAGCC 
GCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCC 
GAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTA 
CAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGC 
ATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGC 

25 ACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGAC 
AAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCG 
AGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATC 
GGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTC 
CGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGG 

30 AGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAG 

TAA 

gxam ple 2: Generation of Ch imeric and Fusion Proteins 

DNA vectors encoding fusion proteins between a MCH-R receptor 
35 (MCH1R) and several different superbright variants of Green Fluorescent Protein 
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(GFP) were generated. GFP variants were fused either via a 12 amino acid linker: 
TCGACGGTACCGCGGGCCCGGGATCCATCGCCAGC (SEQ. ID. NO. 30), 
amino acid sequence: YDGTAGPGSIAT (SEQ. ID. NO. 3 1) (linker fusions) or 
directly to the C-terminus of MCH1R (direct fusions). 

5 

Mouse MCHlR-linker-GFP Variant Fu sion Constructs 
Initially, mouse MCH1R was fused in frame via the linker to Enhanced 
Green Fluorescent Protein (EGFP). MCH1R was PCR-amplified (95°C for 5 
minutes; 95°C for 30 seconds, 60°C for 45 seconds, 68°C for 3.5 minutes, for 15 

10 cycles; 68°C for 7 minutes) from a full-length mouse MCH1R genomic DNA lambda 
clone utilizing a high fidelity polymerase mix (Expand High Fidelity PCR System 
from Boehringer Mannheim) and PCR primers [MCH1R (Eco RI) 5': 
GCGAATTCACCATGGATCTGCAAGCCTCG (SEQ. ID. NO. 32), MCH1R (Sal I) 
3': GCGTCGACGGTGCCTTTGCTTTCTGTCC (SEQ. ID. NO. 33)] that generated 

15 Eco RI and Sal I enzymatic restriction sites at the N- and C-terminus, respectively. 
The MCH1R N-terminal PCR primer was also designed to introduce a Kozak 
consensus sequence for translation which contained an Nco I site (5'-ACCATGG-3'), 
and the MCH1R C-terminal PCR primer was also designed to eliminate the 
endogenous stop codon present in the mouse MCH1R gene. The resulting PCR 

20 product was phenol/chloroform extracted, restriction digested with Eco RI and S al I, 
gel purified, and subcloned in frame into the multicloning site of Clontech's pEGFP- 
N3 vector between Eco RI and Sal I sites. Several resulting clones for this construct 
were sequenced to identify a clone with an entirely correct nucleotide sequence. This 
clone was named mMCHlR-l-EGFP for mouse MCHlR-linker-EGFP. 

25 An approximately 760 bp Sal I to Not I fragment of mMCHlR-l-EGFP 

was excised, gel purified, and subcloned into the multicloning site of pBluescript 
(SK+) (Stratagene) between Sail and Not I sites. An approximately 710 bp Nco I to 
Bsr Gl fragment of EGFP was excised from the resulting pBluescript-EGFP vector 
and replaced with the corresponding Nco I to Bsr Gl fragment of either Emerald, 

30 Topaz, or W1B (other superbright GFP variants), which were excised from vectors 
pRSET-Emerald, pRSET-Topaz, and pRSET-WlB, respectively. pRSET-Emerald, 
pRSET-Topaz, and pRSET-WlB were obtained from Aurora Biosciences Co. Sal I to 
Not I fragments containing either Emerald, Topaz, or W1B were excised from the 
resulting pBluescript-Emerald, pBluescript-Topaz, and pBluescript-WTB vectors, 

35 respectively. Appropriate fragments were gel purified and subcloned into mMCHlR- 
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l-EGFP digested with Sal I and Not I, replacing the Sal I to Not I EGFP fragment with 
the corresponding Sal I to Not I fragment from either Emerald, Topaz, or W1B. 
Several clones for each construct were sequenced to confirm the presence of the 
appropriate GFP variant. The resulting vectors were named mMCHlR-l-Emerald, 
5 mMCHlR-l-Topaz, and mMCHlR-l-WlB for mouse MCHlR-linker-Emerald, 
mouse MCHlR-linker-Topaz, and mouse MCHlR-linker-WlB, respectively. 

Sflouse MCH1R/GFP Variant Direct Fusion Constructs 

A two step PCR strategy was employed to generate the direct fusion 

10 constructs. First, mouse MCH1R, EGFP, and Emerald were PCR-amplified from a 
full-length mouse MCH1R genomic DNA lambda clone, Clontech's pEGFP-N3 
vector and Aurora's pRSET-Emerald vector, respectively. Mouse MCH1R was 
PCR-amplified according to the previously mentioned conditions utilizing the same 
N-terminal PCR primer [MCH1R (Eco RI) 5': GCGAATTCACCATGGATCTGCA 

15 AGCCTCG (SEQ. ID. NO. 32)], but in this case a different C-terminal PCR pnmer 
was employed. The C-terminal PCR primer [MCH1R (EGFP/Emerald) 3': 
CCTTGCTC ACC ATGGTGCCTTTGCTTTCTGTCC (SEQ. ID. NO. 34)] eliminated 
the endogenous stop codon of mouse MCH1R as before and introduced a region of 
nucleotide sequence complementary to the nucleotide sequence of the N-terminus of 

M) EGFP and Emerald were PCR-amplified (95°C for 5 minutes; 95°C for 

30 seconds, 60°C for 45 seconds, 68°C for 1 .5 minutes, for 15 cycles; 68°C for 7 
minutes) separately with a high fidelity polymerase mix (Advantage HF-2 from 
Clontech) from their respective templates utilizing a common N-termmal PCR pnmer 

25 [EGFP/Emerald (MCH1R) 5':. CAGAAAGCAAAGGCACCATGGTGAGCAA 
GGGCGAGGAGC (SEQ. ID. NO. 35)] that generated a region of nucleotide 
sequence complementary to the C-terminus of mouse MCH1R and C-terminal PCR 
primers [EGFP 3': GGCGGATCCTCTAGAGTCGCGGCC (SEQ. ID. NO. 36), or 
Emerald (EGFP) 3': GCTCTAGAGTCGCGGCCGCTTACTTGTACAGCTCGTCC 

30 (SEQ. ID. NO. 37)] that generated a Not I site at the C-terminus. The resultmg PCR 
products were electrophoresed on an agarose gel and the appropriate fragments were 
gel purified. 

In a second PCR step, PCR reactions were set up between the 
previously generated mouse MCH1R and EGFP, or mouse MCH1R and Emerald 
35 PCR products. Following an initial 5 minute denaturation step at 95°C, two rounds of 
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thermocycling (95°C for 30 seconds, 60°C for 45 seconds, 68°C for 4 minutes) were 
performed in the absence of PCR primers. This allowed the mouse MCH1R and GFP 
variants to anneal at their complementary regions and to be filled in by the high 
fidelity polymerase mix (Expand High Fidelity PCR System from Boehringer 

5 Mannheim), yielding double stranded template DNA. 

Subsequently, the common N-terminal mouse MCH1R [MCH1R (Eco 
RI) 5': GCGAATTC ACC ATGGATCTGC AAGCCTCG (SEQ. ID. NO. 32)] and 
appropriate C-teiminal PCR primers [EGFP 3': GGCGGATCCTCTAGAGTC 
GCGGCC (SEQ. ID. NO. 36) or Emerald (EGFP) 3': GCTCTAGAGTCGCGG 

10 CCGCTTACTTGTACAGCTCGTCC (SEQ. ID. NO. 37)] were added to the 

reactions and thermocycling was continued for an additional fifteen cycles followed 
by a final extension at 68°C for 7 minutes. The resulting PCR products were 
phenol/chloroform extracted, restriction digested with Eco RI and Not I, 
electrophoresed on an agarose gel, and appropriate fragments were gel purified. 

1 5 These Eco RI to Not I fragments represent direct fusions between 

either mouse MCH1R and EGFP, or mouse MCH1R and Emerald. Clontech's 
pEGFP-N3 vector was restriction digested with Eco RI and Not I liberating an 
approximately 780 bp Eco RI to Not I EGFP fragment. This restriction digest was 
electrophoresed on an agarose gel and the approximately 3 .9 Kb pEGFP-N3 vector 

20 backbone was gel purified. Eco RI to Not I mouse MCH1R/EGFP or mouse 

MCHIR/Emerald direct fusion fragments were subcloned into the pEGFP-N3 vector 
backbone between Eco RI and Not I sites. Several resulting clones for each of these 
two constructs were sequenced to identify clones with correct nucleotide sequence; 
however, no clones with entirely correct nucleotide sequences were identified. 

25 Fortunately, several clones for each of the two constructs only had nucleotide 

mismatches in the intron region of mouse MCH1R, and therefore, were not expected 
to effect the functionality of the resulting fusion proteins. These clones were named 
mMCHlR/EGFP and mMCHlR/Emerald for mouse MCH1R/EGFP direct fusion and 
mouse MCHIR/Emerald direct fusion, respectively. 

30 

Human Short and Long Form/Mouse S pecies Chimeric 

MCHlR-linker-GFP Variant Fusion Constructs 
The initial mouse MCHlR-linker-GFP variant fusion constructs were 
modified to generate both human short form and human long form/mouse species 
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chimeric MCHlR-linker-GFP variant fusion constructs. An approximately 1 .7 kb 
Hind ffl to Bsp EI fragment of the mouse MCHIR gene containing exon 1, the intron, 
and 127 amino acids of exon 2 was excised from the various mouse MCHlR-lmker- 
GFP variant fusion constructs and replaced by either an approximately 470 bp Hind 
5 m to Bsp EI fragment from the wild-type human MCHIR short form or an 

approximately 670 bp Hind m to Bsp EI fragment from the wild-type human MCHIR 
long form. 

Several clones for each construct were sequenced to confirm the 
presence of the N-terminal region of either the human MCHIR short or long forms. 
0 These clones were named hshort/mMCHlR-1-GFP variant or hlong/mMCHlR-l-GFP 
variant for human short form/mouse species chimeric MCHlR-linker-GFP variant 
and human long form/mouse species chimeric MCHlR-linker-GFP variant, 
respectively. 

L5 ayua Dlel; FnnnHonalEv tluatjon nf MOTIF /GFP Variant Fusion Proteins 

Both HEK293 Aequorin (National Institutes of Health) and CHO 
mammalian cell lines were transiently transfected with the various MCH1R/GFP 
variant fusion constructs, as well as the appropriate control constructs. Transfection 
was performed using Lipofectamine 2000 (Gibco BRL) per the manufacturer 
20 recommended protocol. Approximately 48 hours after transfection cells were 

harvested, stimulated with various concentrations of human MCH, and assayed for 
either aequorin bioluminescence (HEK293 Aequorin cells) or cAMP production 
(CHO cells). Aequorin bioluminescence is a representative measure of intracellular 
Ca 2+ mobilization. cAMP production was measured with the Adenylyl Cyclase 
25 Activation FlashPlate Assay (NEN Life Science Products, Inc.). 

Following transient transfection of the mMCHlR-linker-EGFP 
construct (MCH-R-1-EGFP) into HEK293 Aequorin cells, the resulting fusion protein 
exhibited functional activity comparable to that of the wild-type human MCHIR short 
form (MCH-R wt). By this functional assay, the EC 50 value for mMCHl R-l-EGFP 
was nearly identical to that of the wild-type human short form receptor (Figure 1). 

Following transient transfections of the mMCHlR-l-EGFP and 
inMCHlPv/EGFP fusion constructs into CHO cells, the resulting fusion proteins 
exhibited functional activity comparable to that of the wild-type human MCHIR short 
form. By this functional assay, the EC 50 values for mMCHlR-l-EGFP and 
mMCHlR/EGFP were comparable to that of the wild-type human receptor (Table 1). 



30 
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Transient transfections with the corresponding Emerald constructs yielded similar 
results (data not shown). 



Table 1 





EC50 (nM) 


JRj^fptor 

W iiH-tvpe Human MCH1R Short Form 


2.166 


Mouse MCH1R/EGFP 


0.819 


j^nseMCHlR-l-EGFP 


% 1QQ 



10 



15 



Following transient transfections of the human short form/mouse 
species chimeric MCH1R-1-EGFP (HuShort/mMCHlR-l-EGFP) and human long 
form/mouse species chimeric MCH1R-1-EGFP (HuU.ng/mMCHlR-1-EGFP) 
constructs into HEK293 cells, the resulting fusion proteins exhibited functional 
activity comparable to that of the wild-type human MHC1R short and long forms, 
respectively By this functional assay, the EC 50 value for each fusion protems was 
nearly identical to that of the corresponding wild-type human receptor (Table 2). 

Table 2 



Receptor 

Wild-type Human MCH1R Short Form — 
jHuShort/mMCHlR-l-EGFP 


EC50 (nM) 

22.27 , 

19.54 . ■ 


Wild-type Human MCH1R Long Form 


196.7 , 


_HuLong/mMCHlR-l-EGFP Form 


217.5 . — — : — l 



20 



25 



Following transient transfections of the human short form/mouse 
species chimeric MCH1R-1-EGFP (HuShort/mMCHlR-l-EGFP) and human long 
form/mouse species chimeric MCH1R-1-EGFP (HuLong/mMCHlR-l-EGFP) 
constructs into CHO cells, the resulting fusion proteins exhibited functional activity 
comparable to or less than that of the wild-type human MHC1R short and long forms, 
respectively (Table 3). By this functional assay, the EC 5 0 value for the human short 
form/mouse species chimeric MCH1R-1-EGFP fusion protein was comparable to that 
of the corresponding wild-type human receptor, whereas, the human long form/mouse 
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species chimeric MCH1R-1-EGFP fusion protein had an EC 5 Q value approximately 
7.5-fold higher than that of its corresponding wild-type control. 



Table 3 



Receptor 


EC50 (nM) 


Wild-tvpe Human MCH1R Short Form 


1.029 


Wild-type Human MCH1R Long Form 


1.515 


HuShort/mMCHlR-l-EFGP 


1.565 


jTuLong/mMCHlR-l-EGFP 


11.580 1 



Transient expression of all the MCH1R/GFP valiant fusion proteins 
that underwent functional evaluation resulted in fluorescence primarily associated 
with the plasma membrane in both HEK293 and CHO cells (data not shown). Tins 
10 pattern of fluorescence is consistent with a predominant membrane associated 
localization. 

gv am ple 4: G yration of Stable Cell Lines 

Wild-type CHO cells were transfected using SuperFect (Qiagen) and 

15 either mouse MCH-1R-EGFP or human short/mouse species chimeric MCH-1R- 
EGFP Forty-eight hours after transfection, transfected cells were subjected to 
positive selection for approximately ten days in media containing G418. Following 
selection MCH-1R-EGFP expressing CHO cells were bulk sorted by Fluorescence 
Assisted Cell Sorting (FACS) for one or two rounds on the basis of fluorescence 

20 intensity to increase the population of cells expressing EGFP. Following bulk sorts, 
individual clones of varying fluorescence intensities were isolated by FACS and 

expanded. .._.■« 
Fluorometric Microvolume Assay Technology (FMAT) was initially 

employed to screen a large number of stable clones by whole cell binding with a 
25 fluorescently labeled MCH derivative (SymJz-MCH, PE Biosystems) to idenffy those 
clones with good specific binding windows. Several clones exhibiting specific 
binding windows greater than 3-fold were further evaluated for MCH binding with the 
SPA-based Binding Assay. Cells from individual clones were dissociated m enzyme 
• free dissociation media and cell membranes were prepared and subsequently tested for 
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their ability to bind [ 125 I]Phe ,3 Tyr 19 -MCH in the presence of human MCH. CHO cell 
lines expressing either mouse MCH-1R-EGFP or human short/mouse species 
chimeric MCH-1R-EGFP (Figure 4) displayed IC50 values with MCH that were 
indistinguishable from the corresponding IC50 values obtained with a CHO cell lme 

5 expressing the wild-type human short isofonn of MCH-1R. 

The functional activity of these clones was evaluated with the cAMP 
Flashplate Assay (Figures 2 and 3). CHO cell lines expressing either mouse MCH- 
1R-EGFP (Figure 2) or human short/mouse species chimeric MCH-1R-EGFP (Figure 
3) displayed EC50 values with human MCH that were indistinguishable from the 

10 EC50 value obtained with a CHO cell line expressing the wild-type human short 

isoformof MCH-1R. 

The subcellular localization of the MCH-1R-EGFP fusion proteins 
were determined by confocal microscopy utilizing EGFP fluorescence as a marker for 
MCH-1R expression. CHO cell lines stably expressing either mouse MCH-1R-EGFP 
15 or human short/mouse species chimeric MCH-1R-EGFP displayedEGFP 

fluorescence primarily associated with the plasma membrane, demonstrating that 
these MCH-1R-EGFP fusion proteins are primarily associated with the plasma 
membrane. 

Other embodiments are within the following claims. While several 
,0 embodiments have been shown and described, various modifications may be made 

without departing from the spirit and scope of the present invention. 
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WHAT IS CLAIMED IS: 



1. A fusion protein comprising: 

a) a melanin concentrating hormone receptor polypeptide 
region comprising a sequence selected from the group consisting of: SEQ. ID. NO. 1, 
SEQ. E). NQ. 2, SEQ. SEQ. ID. NO. 3, SEQ. ID. NO. 4, and SEQ. ID. NO. 5; and 

b) a fluorescent polypeptide region joined directly, or 
though a linker, to the carboxy side of said melanin concentrating hormone receptor 
polypeptide region. 

2. The protein of claim 1 , wherein said fluorescent polypeptide 
region consists of an amino acid sequence selected from the group consisting of SEQ. 
ID. NO. 6, SEQ. ID. NO. 7, SEQ. ID. NO. 8, SEQ. ID. NO. 9, and SEQ. ID. NO. 10. 

3 T i ie protein of claim 2, wherein said melanin concentrating 
hormone polypeptide region consists of a sequence selected from the group consisting 
of: SEQ. ID. NO. 1, SEQ. ID. NO. 2, SEQ. ID. NO. 3, SEQ. ID. NO. 4, and SEQ. ID. 
NO. 5. 



10 



20 



4. The protein of claim 3, wherein said protein consists essentially 
of said melanin concentrating hormone receptor polypeptide region and said 
fluorescent polypeptide region. 

5. The protein of claim 4, wherein said protein consists of the 
25 aminoacidsequenceofSEQ.ID.NO.llorSEQ.ID.NO.12. 

6. The protein of claim 1 , wherein said melanin concentrating 
hormone polypeptide region is a chimeric polypeptide comprising (a) an MCH 
binding region from a first species and (b) a transmembrane and intracellular domain 

30 region from a second species joined directly, or though a linker, to the carboxy side of 
said MCH binding region. 

7. The protein of claim 6, wherein said fluorescent polypeptide 
region consists of an amino acid sequence selected from the group consisting of: SEQ. 

35 ID. NO. 6, SEQ. ID. NO. 7, SEQ. ID. NO. 8, SEQ. ID. NO. 9, and SEQ. ID. NO. 10. 
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8. The protein of claim 7, wherein said protein consists of the 
amino acid sequence of SEQ. ID. NO. 13 or SEQ. ID. NO. 14. 

9. A chimeric melanin concentrating hormone protein comprising: 

a) a melanin concentrating hormone binding region 

characteristic of a human melanin concentrating hormone receptor; 

b) a transmembrane domain characteristic of anon-human 

melanin concentrating hormone receptor; and 

c) an intracellular domain characteristic of a non-human 

melanin concentrating hormone receptor. 



10. The protein of claim 9, wherein said protein comprises a 
melanin concentrating hormone receptor polypeptide having a sequence similarity of 
15 at least 75% with either SEQ. ID. NO. 4 or SEQ. ID. NO. 5. 

1 1 The protein of claim 10, wherein said protein comprises the 
sequence of SEQ. ID. NO. 4 or SEQ. ID. NO. 5. 



20 



12. The protein of claim 1 1 , wherein said protein consists of the 
sequence of SEQ. ID. NO. 4 or SEQ. ID. NO. 5. 

13. A nucleic acid comprising a nucleotide sequence encoding for 
the protein of claim 1. 



25 



14. The nucleic acid of claim 13, wherein said nucleotide sequence 
is a contiguous sequence. 

15 The nucleic acid of claim 13, wherein said nucleotide sequence 
30 is selected from the group consisting of SEQ. ID. NO. 26, SEQ. ID. NO. 27, SEQ. ID. 
NO. 28 and SEQ. ID. NO. 29. 

16. A nucleic acid comprising a nucleotide sequence encoding for 
the protein of claim 9. 

35 
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17. The nucleic acid of claim 1 6, wherein said nucleotide sequence 
is a contiguous sequence. 

18. The nucleic acid of claim 16, wherein said nucleotide sequence 
5 is selected from the group consisting of SEQ. ID. NO. 19 and SEQ. ID. NO. 20. 

19. An expression vector comprising the nucleic acid of claim 13. 

20. An expression vector comprising the nucleic acid of claim 16. 

21 . A recombinant cell comprising the nucleic acid of claim 13. 

22. The recombinant cell of claim 21, wherein said nucleic acid is 
present in an expression vector. 

23. The recombinant cell of claim 21, wherein said nucleic acid is 
present in the genome of said cell. 

24. A recombinant cell comprising the nucleic acid of claim 16. 

25. The recombinant cell of claim 24, wherein said nucleic acid is 
present in an expression vector. 

26. The recombinant cell of claim 24, wherein said nucleic acid is 
25 present in the genome of said cell. 

27. A non-human transgenic animal comprising the nucleic acid of 

claim 13. 

28. A non-human transgenic animal comprising the nucleic acid of 



15 



20 



30 



claim 16. 



29 . A method for assaying for melanin concentrating hormone 
receptor active compounds comprising the steps of: 
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a) contacting the cell of claim 21 with a test preparation 

comprising one or more test compounds; and 

b) measuring the effect of said test preparation on one or 

more melanin concentrating hormone receptor activities. 

30. A method for assaying for melanin concentrating hormone 

receptor active compounds comprising the steps of: 

a) contacting the cell of claim 24 with a test preparation 

comprising one or more test compounds; and 

b) measuring the effect of said test preparation on one oi 

more melanin concentrating hormone receptor activities. 
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SEQUENCE LISTING 

<110> Merck & Co . , Inc. 

<12 0> MELANIN CONCENTRATING HORMONE RECEPTOR 
CHIMERIC AND FUSION PROTEINS 

<130> 20652 PCT 

<150> 60/189,698 
<151> 2000-03-15 

<160> 37 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 422 
<212> PRT 
<213> Human 

Me2°Ser Val Gly Ala Met Lys Lys Gly Val Gly Arg Ala Val Gly Leu 

G ly Gly Gly Ser Gly Cys Gin Ala Thr Glu Glu Asp Pro Leu Pro Asn 

Cys Gly Ala lys Ala Pro Gly Gin Gly Gly Arg Arg Trp Arg Leu Pro 

Gin Pro Ala Trp. Val Glu Gly Ser Ser Ala Arg Leu Trp Glu Gin Ala 

Thr Sy Thr Gly Trp Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly 

Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala 
85 



Gly Ser Pro Pro Arg Thr Gly Ser lie Ser Tyr He Asn lie lie Met 

Leu Leu Gly He He 
Il5 _ 120 125 



Pro Ser Val Phe Gly Thr lie Cys Leu Leu Gly He lie Gly Asn Ser 



Thr Val lie Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn 
Asn HI Pro Asp He Phe He He Asn Leu Ser Val Val Asp Leu Leu 
«£ Leu Leu Gly Met Pro Phe Met He His Gin Leu Met Gly Asn Gly 
Val Trp His Phe III Glu Thr Met Cys Thr Leu He Thr Ala Met Asp 
Ala Asn Ser Gin Phe Thr Ser Thr Tyr He Leu Thr Ala Met Ala He 
AS p Arg iyr Leu Ala Thr Val III Pro He Ser Ser Thr Lys Phe Arg 
Lys pro Ser Val Ala Thr Leu Val He Cys Leu Leu Trp Ala Leu Ser 
III He Ser He Thr Pro Val Trp Leu Tyr Ala Arg Leu He Pro Phe 
Pro Gly Gly Ala Val Gly Cys Gly lie Arg Leu Pro Asn Pro Asp Thr 

•per) 2 6-3 - 

Asp Leu Tyr Trp Phe Thr Leu Tyr Gin Phe Phe Leu Ala Phe Ala Leu 
Pro Phe Sal Val He Thr Ala Ala Tyr Val Arg He Leu Gin Arg Met 
290 295 
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Thr Ser Ser Val Ala Pro Ala Ser Gin Arg Ser lie Arg Leu Arg Thr 
lyl Arg Val Thr Arg Tnr Ala lie Ala lie Cys Leu Val Phe Phe Val 
Cys Trp Ala Pro Tyr Tyr Val Leu Gin Leu Thr Gin Leu Ser He Ser 

y 340 3 " 5 



Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr lie Val Leu Cys 

^ y 375 380 

" " Arg Leu Val Leu Ser Val 

390 395 



Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gin 



Ara Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala He Ser Leu 
355 360 
Tyr Ala Asn Ser Cys Leu 
370 375 
Thr Phe Arg Lys Arg Leu 

390 - 

HI Gin Leu Arg Ala Val Ser Asn Ala Gin Thr Ala Asp Glu Glu Arg 

y 405 410 

Thr Glu Ser Lys Gly Thr 
420 

<210> 2 
<211> 353 
<212> PRT 
<213> Human 

Me2°Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn 

Tnr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg 

Thr Gly Ser lie Ser Tyr lie Asn He lie Met Pro Ser Val Phe Gly 

Thr He Cys Leu Leu Gly He lie Gly Asn Ser Thr Val He Phe Ala 

val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp He 

Phe He He Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met 

85 90 
Pro Phe Met lie His Gin Leu Met Gly Asn Gly Val Trp His Phe Gly 



Glu Thr Met Cys Thr Leu He Thr Ala Met Asp Ala Asn Ser Gin Phe 



115 120 125 



Thr Ser Thr Tyr He Leu Thr Ala Met Ala He Asp Arg Tyr Leu Ala 

Ser Thr Lys Phe Arg 
150 ' 155 



Thr- Val His Pro He Ser Ser Thr Lys Phe Arg Eys Pro Ser Val Ala 



l£ Leu Val lie Cys Leu Leu Trp Ala Leu Ser Phe He Ser He Thr 

Pro Val Trp Leu Tyr Ala Arg Leu lie Pro Phe Pro Gly Gly Ala Val 

180 185 ■ L ^ u 

Glv Cys Gly He Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe 

Thr Leu Tyr Gin Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val He 

Thr HI Ala Tyr Val Arg III Leu Gin Arg Met Thr Ser Ser Val Ala 

III Ala Ser Gin Arg Ser He Arg Leu Arg Thr Lys Arg Val Thr Arg 

245 250 " 3 

Thr Ala He Ala He' Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr 
inr no 265 27 0 

Tvr val Leu Gin Leu Thr Gin Leu Ser He Ser Arg Pro Thr Leu Thr 

y 975 280 0 

Phe Val ^yr Leu Tyr Asn Ala Ala He Ser Leu Gly Tyr Ala Asn Ser 
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Cys Leu Asn Pro Phe Val Tyr lie Val Leu Cys Glu Thr Phe Arg Lys 
Arg Leu Val Leu Ser VaS Lys Pro Ala Ala Gin Gly Gin Leu Arg Ala 
Val Ser Asn Ala III Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly 
Thr 



<210> 3 
<211> 353 
<212> PRT 
<213> Mouse 



Me^Asp Leu Gin Ala Ser Leu Leu Ser Thr Gly Pro Asn Ala Ser Asn 
iL Ser Asp Gly Gin Asp Asn Phe Thr Leu Ala Gly Pro Pro Pro Arg 
Thr Arg Ser Val Ser Tyr He Asn He lie Met Pro Ser Val Phe Gly 
Thr lie Cys Leu Leu Gly lie Val Gly Asn Ser Thr Val lie Phe Ala 
val Val Lys Lys Ser Lys Leu His Trp Cys Ser Asn Val Pro Asp lie 
Phe lie He Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu. Gly Met 



Pro Phe Met He His Gin Leu Met Gly Asn Gly Val Trp His Phe Gly 
Glu Thr Met Cys Thr Leu He Thr Ala Met Asp Ala Asn Ser Gin Phe 
Thr Ser Thr Tyr lie Leu Thr Ala Met Ala lie Asp Arg Tyr Leu Ala 
Thr VaJ His Pro He Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala 



Thr Leu Val He Cys Leu Leu Trp Ala Leu Ser Phe He Ser lie Thr 

Tyr Ala Arg Leu lie 
185 

Arg Leu Pro Asn Pro 
200 

Phe Phe Leu Ala Phe 
215 

Val Lys lie Leu Gin 
230 

Arg Ser He Arg Leu 
245 

He Cys Leu Val Phe 
Tyr val Leu Mn Leu Thr Gin Leu Ser lie Ser Arg Pro Thr Leu Thr 



Pro Val Trp Leu Tyr Ala Arg Leu lie Pro Phe Pro Gly Gly Ala Val 



Gly cys Gly HI Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe 
Thr Leu Tyr Gin Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val He 
Thr III Ala Tyr Val Lys He Leu Gin Arg Met Thr Ser Ser Val Ala 
III Ala Ser Gin Arg ler He Arg Leu Arg Thr Lys Arg Val Thr Arg 
Thr Ala He Ala 111 Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr 



Phe Val Tyr Leu Tyr Asn Ala Ala He Ser Leu Gly Tyr Ala Asn Ser 

Cys Leu Asn Pro Phe Val Tyr He Val Leu Cys Glu Thr Phe Arg Lys 

HI L eu val Leu Ser III Lys Pro Ala Ala Gin Gly Gin Leu Arg Thr 

Val Ser Asn Ala III Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly 
340 

Thr 
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<210> 4 
<211> 353 
<212> PRT 

<213> Artificial Sequence 

<223> Human short form/mouse species chimeric MCH1R 

Me?°Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn 

Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg 

Thr Gly Ser lie Ser Tyr lie Asn He lie Met Pro Ser Val Phe Gly 

Thr lie Cys Leu Leu Gly He lie Gly Asn Ser Thr Val lie Phe Ala 

Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp lie 

Phe He He Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met 

Pro Phe Met He Sis Gin Leu Met Gly Asn Gly Val Trp His Phe Gly 

Glu Thr Met Cys Thr Leu He Thr La Met Asp Ala Asn Ser Gin Phe 

115 120 lz:> „ 

Thr Ser Thr Tyr He Leu Thr Ala Met Ala He Asp Arg Tyr Leu Ala 

1 *a n 135 j-^*u 

Thr Val His Pro He Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala 

Thr Leu Val He Cys Leu Leu Trp Ala Leu Ser Phe He Ser lie Thr 

Pro Val Trp Leu Tyr Ala Arg Leu He Pro Phe Pro Gly Gly Ala Val 

Gly Cys Gly He Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe 

Thr Leu Tyr Gin Phe Phe Leu III Phe Ala Leu Pro Phe Val Val He 

Thr III Ala Tyr Val Lys He Leu Gin Arg Met Thr Ser Ser Val Ala 

Pro Ala Ser Gin Arg Ser lie Arg Leu Arg Thr Lys Arg Val Thr Arg 

245 250 
Thr Ala He Ala lie Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr 

Tyr val Leu Gin" Leu Thr Gin Leu Ser lie Ser Arg Pro Thr Leu Thr 

275 280 285 

Phe Val Tyr Leu Tyr Asn Ala Ala lie Ser Leu Gly Tyr Ala Asn Ser 

Cys llu Asn Pro Phe Val HI lie Val Leu Cys Glu Thr Phe Arg Lys 

irg Leu Val Leu Ser HI Lys Pro Ala Ala Gin Gly Gin Leu Arg Thr 

Val Ser Asn Ala Gin Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly 
340 345 

Thr 

<210> 5 
<211> 422 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Human long form/mouse species chimeric MCH1R 

Metier Val Gly Ala Met Lys Lys Gly Val Gly Arc, Ala Val Gly Leu 
eJy Gly Gly Ser G^y Cys Gin Ala Thr Glu Glu Asp Pro Leu Pro Asn 
Cys Gly Ala Cys Ala Pro Gly Gin Gly Gly Arg Arg Trp Arg Leu Pro 
Gin Pro Ala Trp Val Glu Gly Ser Ser Ala Arg Leu Trp Glu Gin Ala 
Thr G?y Thr Gly Trp Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly 



Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala 
GLy ser Pro Pro Arg Thr Gly Ser lie Ser Tyr lie Asn lie He Met 



Thr Val He Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn 



140 



Pro Ser Val Pne Gly Thr He Cys Leu Leu Gly lie lie Gly Asn Ser 

Ala Val Val 
135 

He Phe He 
150 

Met Pro Phe 
165 

Gly Glu Thr 

Ala Asn Ser Oil Phe Thr Ser Thr Tyr lie Leu Thr Ala Met Ala lie 



Phi Leu Leu Gly Met Pro Phe Met He His Gin Leu Met Gly Asn Gly 
Val Trp His Phe Gly Glu Thr Met Cys Thr Leu lie Thr Ala Met Asp 



Asn Val Pro Asp He Phe He He Asn Leu Ser Val Val Asp Leu Leu 

Phe Met He His 
170 

Thr Met Cys Thr 
185 

Ser Thr Tyr He 
200 

Val His Pro He 
215 

Leu Val He Cys 

III He Ser He Thr Pro Val Trp Leu Tyr All Arg Leu He Pro Phe 



Asp Arg Tyr Leu Ala Thr Val His Pro He Ser Ser Thr Lys Phe Arg 
Lys Pro Ser Met Ala Thr Leu Val He Cys Leu Leu Trp Ala Leu Ser 



235 

OAS - 250 

Pro Gly Gly Ala Val Gly Cys Gly He Arg Leu Pro Asn Pro Asp Thr 
2 60 2 0 3 

Tyr Trp Phe Thr Leu Tyr 
275 280 
Val Val He Thr Ala Ala 

Thr Ser Ser Val Ala Pro III Ser Gin Arg Ser He Arg Leu Arg Thr 



Asp Leu Tyr Trp Phe Thr Leu Tyr Gin Phe Phe Leu Ala Phe Ala Leu 
Pro Phe HI Val He Thr Ala Ala Tyr Val Lys He Leu Gin Arg Met 



ni n '315 
III Arg Val Thr Arg Thr Ala He Ala He Cys Leu Val Phe Phe Val 



cys Trp Ala Pro Tyr Tyr Val Leu Gin Leu Thr Gin Leu Ser He Ser 

Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala He Ser Leu 

Giy T yr III Asn Ser Cys Leu Asn Pro Phe Val Tyr He Val Leu Cys 

Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gin 

HI Gin Leu Arg Thr III Ser Asn Ala Gin Thr Ala Asp Glu Glu Arg 

* . 405 410 

Thr Glu Ser Lys Gly Thr 
420 

<210> 6 
<211> 238 
<212> PRT 

<213> Aequorea Victoria 
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olu Leu Asp Gly Asp Val Asn Gly His £ys Phe Ser. Val Ser Gly Glu 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 



Me?°Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 

Ly 
Le 

Thr Thr lly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
Lys Phe Glu G?y Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
ASP Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn 
Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
HI \,ys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
Gin Leu Ala Asp J" Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 



Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 



200 205 



Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

■ X. D 



Thr III Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 

<210> 7 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GFP derivative 

Me?°Va 7 l Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
val Glu Leu Asp c'ly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
Glu Gly Glu G°ly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
Cys Thr tL Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
L eu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
SL Hi. Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
Thr He Phe Phe Lys Asp Asp Gly Asn' Tyr Lys Thr Arg Ala Glu 



val Lys Phe G?u Gly Asp Thr Leu Vai Asn Arg He Glu Leu Lys Gly 



He Asp Pne Lys Glu Asp Gly "sn lie Leu Gly His £ys Leu Glu Tyr 



A sn Tyr Asn Ser His Asn HI Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 



-6- 



WO 01/68706 



PCT/US01/08071 



Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

<210> 8 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GFP derivative 
<400> 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

! 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 " 40 45 

Cvs Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 " 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arcr Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 HO 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Lys Val Tyr He Thr Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

c er Lvs Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe . 

210 215 " 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

<210> 9 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GFP derivative 

Met°Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
l 5 10 15 
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Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

Cvs Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Phe Gly Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Arg 
65 70 75 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 HO 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

H5 120 t 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

145 150 155 , I 

Glv He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

<210> 10 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GFP derivative 
<400> 10 

Ser Lys Gly Glu Glu Leu pne __ - 
x 5 10 15 

i m.. To,, A en \r Acsr'i Val Asn _ 

25 30 
Thr Tyr Gly Lys Leu Thr Leu Lyj 

40 45 
Pro Val Pro Trp Pro Thr Leu Va. 
50 55 60 

- Trp Gly Val Gin Cys Phe Ser Arg. Tyr Pre 
65 70 75 80 

- — t^u« t™ A i a Met Pro Glu Gly Tyr Val Glr 

90 95 
Asp Gly Asn Tyr Lys Thr Arg Ale 

105 HO. 
Leu Val Asn Arg He Glu Leu Lys 
120 125 

... , _„ Asn He Leu Gly His Lys Leu Gl\ 

?30 ■ 135 140 

Ala Asp -uys <aJ-n uye 

145 ^>u 155 „ 160 

' - »j- t™ K^rr H is Asn He Glu Asp Gly 

170 175 

Asn Thr Pro lie Gly Asp 

180 " 185 190 



Ser 


Lys 


Gly 
5 


Glu 


Leu 


Asp 


Gly 


Asp 




20 






Glu 


Gly 


Asp 


Ala 


35 








Thr 


Gly 


Lys 


Leu 


Trp 


Gly 


Val 


Gin 








70 


Asp 


Phe 


Phe 


Lys 




85 




He 


Phe 


Phe 


Lys 




100 






Phe 


Glu 


Gly 


Asp 


115 








Phe 


Lys 


Glu 


Asp 


He 


Ser 


His 


Asn 








150 


Lys 


Ala 


His 


Phe 




165 




Leu 


Ala 


Asp 


His 
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Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

<210> 11 
<211> 604 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mouse MCH1R-1 inker -EGFP 
<400> 11 

.on mn Ala Ser Leu Leu Ser Thr Gly Pro Asn Ala Ser Asn 

10 15 
Thr Leu Ala Gly Pro Pro Pro Arg 
25 30 
lie He Met Pro Ser Val Phe Gly 
45 

Gly Asn Ser Thr Val He Phe Ala 
60 

Trp Cys Ser Asn Val Pro Asp He 

75 80 
Asp Leu Leu Phe Leu Leu Gly Met 

90 95 
Gly Asn Gly Val Trp His Phe Gly 
105 HO 
Ala Met Asp Ala Asn Ser Gin Phe 
125 

Met Ala He Asp Arg Tyr Leu Ala 
140 

Lys Phe Arg Lys Pro Ser Met Ala 
155 160 
Ala Leu Ser Phe lie Ser He Thr 

170 175 
He Pro Phe Pro Gly Gly Ala Val 
185 190 
Pro Asp Thr Asp Leu Tyr Trp Phe 
205 

Phe Ala Leu Pro Phe Val Val He 
220 

Gin Arg Met Thr Ser Ser Val Ala 
235 240 
Leu Arg Thr Lys Arg Val Thr Arg 

250 255 
Phe Phe Val Cys Trp Ala Pro Tyr 
265 270 
Ser He Ser Arg Pro Thr Leu Thr 
285 

He Ser Leu Gly Tyr Ala Asn Ser 
300 

Val Leu Cys Glu Thr Phe Arg Lys 
315 320 
Ala Ala Gin Gly Gin Leu Arg Thr 

330 335 
Glu Glu Arg Thr Glu Ser Lys Gly 
345 350 
Gly Ser He Ala Thr Met Val Ser 
3 65 



Met 


Asp 


Leu 


Gin 


Ala 
5 


Ser 


Leu 


Leu 


1 

He 


Ser 


Asp 


Gly 


Gin 


Asp 


Asn 


Phe 








20 










Thr 


Arg 


Ser 


Val 


Ser 


Tyr 


He 


Asn 




35 










40 


Thr 


lie 


Cys 


Leu 


Leu 


Gly 


lie 


Val 




50 










55 


His 


Val 


Val 


Lys 


Lys 


Ser 


Lys 


Leu 


65 










70 




Val 


Phe 


He 


He 


Asn 


Leu 


Ser 


Val 








85 








Pro 


Phe 


Met 


He 


His 


Gin 


Leu 


Met 








100 










Glu 


Thr 


Met 


Cys 


Thr 


Leu 


He 


Thr 




115 










120 


Thr 


Ser 


Thr 


Tyr 


He 


Leu 


Thr 


Ala 




130 










135 




Thr 


Val 


His 


Pro 


He 


Ser 


Ser 


Thr 


145 










150 






Thr 


Leu 


Val 


He 


Cys 


Leu 


Leu 


Trp 










165 








Pro 


Val 


Trp 


Leu 


Tyr 


Ala 


Arg 


Leu 








180 










Gly 


Cys 


Gly 


He 


Arg 


Leu 


Pro 


Asn 


195 










200 


Thr 


Leu 


Tyr 


Gin 


Phe 


Phe 


Leu 


Ala 


210 








215 




Thr 


Ala 


Ala 


Tyr 


Val 


Lys 


He 


Leu 


225 










230 






Pro 


Ala 


Ser 


Gin 


Arg 


Ser 


He 


Arg 










245 








Thr 


Ala 


He 


Ala 


He 


Cys 


Leu 


Val 








260 










Tyr 


Val 


Leu 


Gin 


Leu 


Thr 


Gin 


Leu 




275 










280 


Phe 


Val 


Tyr 


Leu 


Tyr 


Asn 


Ala 


Ala 




290 










295 




Cys 


Leu 


Asn 


Pro 


Phe 


Val 


Tyr 


lie 


305 










310 






Arg 


Leu 


Val 


Leu 


Ser 


Val 


Lys 


Pro 








325 








Val 


Ser 


Asn 


Ala 


Gin 


Thr 


Ala 


Asp 






340 










Thr 


Val 


Asp 


Gly 


Thr 


Ala 


Gly 


Pro 




355 










360 
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Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu 

Asp Gly Asp Val Asn Gly III Lys Phe Ser Val Ser Gly Glu Gly Glu 

III Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr 

J 405 410 

Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr 

420 425 <±ju 

Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp 

Phe Phe Lys Ser Ala Met Pro Gxu Gly Tyr Val Gin Glu Arg Thr lie 

Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe 

. - r 470 ft / 3 - 

Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly He Asp Phe 
Lys Glu Asp Gly ill He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn 



Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys 
515 520 525 

Phe Lys He Arg His Asn He Glu Asp Gly Ser 
n 535 540 

Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu 

Leu Pro Asp Asn His Sr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp 

565 57 0 3,J 

Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala 

580 585 
Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
595 600 

<210> 12 
<211> 592 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mouse MCH1R/EGFP 

M e2°Asp 2 Leu Gin Ala Ser Leu Leu Ser Thr Gly Pro Asn Ala Ser Asn 



Val Asn Pne Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu 



lie Ser Asp Gly Gin Asp Asn Phe Thr Leu Ala Gly Pro Pro Pro Arg 



25 30 



Thr Arg Ser Val Ser Tyr He Asn He lie Met Pro Ser Val Phe Gly 



Thr He Cys Leu Leu Gly He Val Gly Asn Ser Thr Val He Phe Ala 



Val Val Lys Lys Ser Lys Leu His Trp Cys Ser Asn Val Pro Asp He 



60 



Phe He He Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met 



75 



Pro Phe Met He Sis Gin Leu Met Gly Asn Gly Val Trp His. Phe Gly 

100 105 ■ L - LU 

Glu Thr Met Cys Thr Leu He Thr Ala Met Asp Ala Asn Ser Gin Phe 

Thr Ser Thr Tyr He Leu Thr Ala Met Ala He Asp Arg Tyr Leu Ala 

Thr vax His Pro lie Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala 



90 95 



Thr Leu Val He Cys Leu Leu Trp Ala Leu Ser Phe He Ser He Thr 
165 170 
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Pro 


Val 


Trp 


Leu 


Tyr 


Ala 


Arg 


Leu He Pro Phe 




180 








185 


Glv Cys 


Gly 


He 


Arg 


Leu 


Pro 


Asn Pro Asp Thr 






195 










200 


Thr 


Leu 


Tvr 


Gin 


Phe 


Phe 


Leu 


Ala Phe Ala Leu 


210 








215 




Thr 


Ala 


Ala 


Tyr 


Val 


Lys 


He 


Leu Gin Arg Met 


225 










230 




235 


Pro 


Ala 


Ser 


Gin 


Arg 


Ser 


He 


Arg Leu Arg Thr 








245 






250 


Thr 


Ala 


lie 


Ala 


He 


Cys 


Leu 


Val Phe Phe Val 




260 








265 


Tvr Val 


Leu 


Gin 


Leu 


Thr 


Gin 


Leu Ser lie Ser 






275 










280 


Phe 


Val 


Tyr 


Leu 


Tyr 


Asn 


Ala 


Ala He Ser Leu 


290 










295 




Cys 


Leu 


Asn 


Pro 


Phe 


Val 


Tyr 


lie Val Leu Cys 


305 










310 




315 


Arg 


Leu 


Val 


Leu 


Ser 


Val 


Lys 


Pro Ala Ala Gin 








325 






330 


Val 


Ser 


Asn 


Ala 


Gin 


Thr 


Ala 


Asp Glu Glu Arg 




340 








345 


Thr 


Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu Leu Phe Thr 




355 










360 


Leu 


Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val Asn Gly His 


370 










375 


Thr Tyr Gly Lys 


Glv Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


385 








390 




395 


He 


Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro Val Pro Trp 






405 






410 


Thr 


Leu 


Thr 


Tyr 


Gly 


Val 


Gin 


Cys Phe Ser Arg 






420 








425 


Lys 


Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser Ala Met Pro 




435 










440 


Glu Arg 


Thr 


He 


Phe 


Phe 


Lys 


Asp Asp Gly Asn 




450 










455 


Thr Leu Val Asn 


Glu 


Val 


Lys 


Phe 


Glu 


Gly 


Asp 


465 








470 




475 


Gly 


He 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly Asn He Leu 








485 






490 


Tyr 


Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val Tyr He Met 






500 








505 


Asn Gly 


He 


Lys 


Val 


Asn 


Phe 


Lys lie Arg His 






515 










520 


Sef Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr Gin Gin Asn 




530 










535 


Asn His Tyr Leu 


Gly 
545 


Pro 


Val 


Leu 


Leu 


Pro 


Asp 










550 




555 


Leu 


Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys Arg Asp His 








565 






570 


Phe 


Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr Leu Gly Met 






580 








585 



190 

Asp Leu Tyr Trp Phe 

205. 

Pro Phe Val Val He 
220 

Thr Ser Ser Val Ala 
240 

Lys Arg Val Thr Arg 
255 

Cys Trp Ala Pro Tyr 
270 

Arg Pro Thr Leu Thr 
285 

Gly Tyr Ala Asn Ser 
300 

Glu Thr Phe Arg Lys 
320 

Gly Gin Leu Arg Thr 
335 

Thr Glu Ser Lys Gly 
350 

Gly Val Val Pro He 
365 

Lys Phe Ser Val Ser 
380 

Leu Thr Leu Lys Phe 
400 

Pro Thr Leu Val Thr 
415 

Tyr Pro Asp His Met 
430 

Glu Gly Tyr Val Gin 
445 

Tyr Lys Thr Arg Ala 
460 

Arg He Glu Leu Lys 
480 

Gly His Lys Leu Glu 
495 

Ala Asp Lys Gin Lys 
510 

Asn He Glu Asp Gly 
525 

Thr Pro He Gly Asp 
540 

Ser Thr Gin Ser Ala 
560 

Met Val Leu Leu Glu 
575 

Asp Glu Leu Tyr Lys 
590 



<210> 13 
<211> 604 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> MCHlR-linker-EGFP 
<400> 13 
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Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn 

1 5 10 15 

Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg 

20 25 30 

Thr Gly Ser He Ser Tyr He Asn He He Met Pro Ser Val Phe Gly 

35 40 45 

Thr He Cys Leu Leu Gly He He Gly Asn Ser Thr Val He Phe Ala 

50 55 60 

Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp He 
65 70 75 80 

Phe He He Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met 

85 90 95 

Pro Phe Met He His Gin Leu Met Gly Asn Gly Val Trp His Phe Gly 

100 105 HO 

Glu Thr Met Cys Thr Leu He Thr Ala Met Asp Ala Asn Ser Gin Phe 

115 120 125 

Thr Ser Thr Tyr He Leu Thr Ala Met Ala He Asp Arg Tyr Leu Ala 

130 135 140 

Thr Val His Pro He Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala 
145 150 155 160 

Thr Leu Val He Cys Leu Leu Trp Ala Leu Ser Phe He Ser He Thr 

165 170 175 

Pro Val Trp Leu Tyr Ala Arg Leu He Pro Phe Pro Gly Gly Ala Val 

180 185 190 

Glv Cys Gly He Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe 

195 200 205 

Thr Leu Tyr Gin Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val He 

210 215 220 

Thr Ala Ala Tyr Val Lys He Leu Gin Arg Met Thr Ser Ser Val Ala 
225 230 235 240 

Pro Ala Ser Gin Arg Ser He Arg Leu Arg Thr Lys Arg Val Thr Arg 

245 250 255 

Thr Ala He Ala He Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr 

260 265 270 

Tvr Val Leu Gin Leu Thr Gin Leu Ser He Ser Arg Pro Thr Leu Thr 

Y 275 280 285 

Phe Val Tyr Leu Tyr Asn Ala Ala He Ser Leu Gly Tyr Ala Asn Ser 

290 295 300 

Cvs Leu Asn Pro Phe Val Tyr He Val Leu Cys Glu Thr Phe Arg Lys 
305 310 315 320 

Ara Leu Val Leu Ser Val Lys Pro Ala Ala Gin Gly Gin Leu Arg Thr 

5 325 330 335 

Val Ser Asn Ala Gin Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly 

340 345 350 

Thr Val Asp Gly Thr Ala Gly Pro Gly Ser He Ala Thr Met Val Ser 

355 360 365 

Lvs Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu 

y 370 375 380 

Asd Glv Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu 
385 390 ~ 395 400 

Glv Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr 

• 405 410 415 

Glv Lvs Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr 

y 420 425 430 

Glv Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp 

* 435 440 445 

Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He 

450 455 460 

Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe 
465 470 475 480 

Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly He Asp Phe 
485 490 495 
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Lys 


Glu 


Asp 


vjxy 


Asn 


Tl e 

I-LC 


Leu 


Glv 






c rj n 










Ser 


His 


Asn 


Val 


Tyr 


He 


Met 


Ala 




515 










520 


Val 


Asn 


Phe 


Lys 


He 


Arg 


His 


Asn 


530 










535 




Ala 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


545 








550 






Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 








565 








Pro 


Asn 


Glu 


Lys 
580 


Arg 


Asp 


His 


Met 


Ala Glv 


lie 


Thr 


Leu 


Gly 


Met 


Asp 






595 










600 



<210> 14 
<211> 673 
<212> PRT 

<213> Artificial Sequence 



His Lys Leu Glu Tyr Asn Tyr Asn 

505 " 510 

Asp Lys Gin Lys Asn Gly lie Lys 

525 . 

He Glu Asp Gly Ser Val Gin Leu 
540 

Pro He Gly Asp Gly Pro Val Leu 
555 560 
Thr Gin Ser Ala Leu Ser Lys Asp 

570 575 
Val Leu Leu Glu Phe Val Thr Ala 
585 590 
Glu Leu Tyr Lys 



<220> 

<223> MCHlR-linker-EGFP 

Met°Ser 4 Val Gly Ala Met Lys Lys Gly Val Gly Arg Ala Val Gly Leu 

1 5 10 15 

Gly Gly Gly Ser Gly Cys Gin Ala Thr Glu Glu Asp Pro Leu Pro Asn 

20 25 30 

Cys Gly Ala Cys Ala Pro Gly Gin Gly Gly Arg Arg Trp Arg Leu Pro 

35 40 45 

Gin Pro Ala Trp Val Glu Gly Ser Ser Ala Arg Leu Trp Glu Gin Ala 

50 55 60 

Thr Gly Thr Gly Trp Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly 

65 70 75 80 

Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala 

85 90 95 

Glv Ser Pro Pro Arg Thr Gly Ser He Ser Tyr He Asn He He Met 

y 100 105 HO 

Pro Ser Val Phe Gly Thr He Cys Leu Leu Gly lie lie Gly Asn Ser 

115 120 125 

Thr Val He Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn 

130 135 140 

Asn Val Pro Asp He Phe He He Asn Leu Ser Val Val Asp Leu Leu 
145 150 155 160 

Phe Leu Leu Gly Met Pro Phe Met He His Gin Leu Met Gly Asn Gly 

165 170 175 

Val Trp His Phe Gly Glu Thr Met Cys Thr Leu He Thr Ala Met Asp 

180 185 190 

Ala Asn Ser Gin Phe Thr Ser Thr Tyr lie Leu Thr Ala Met Ala He 

195 200 205 

Asp Arg Tyr Leu Ala Thr Val His Pro He Ser Ser Thr Lys Phe Arg 

210 215 220 

Lvs Pro Ser Met Ala Thr Leu Val lie Cys Leu Leu Trp Ala Leu Ser 
205 230 235 ^ 240 

Phe He Ser He Thr Pro Val Trp Leu Tyr Ala Arg Leu He Pro Phe 

245 250 255 

Pro Gly Gly Ala Val Gly Cys Gly He Arg Leu Pro Asn Pro Asp Thr 

260 ■ 265 270 

Asp Leu Tyr Trp Phe Thr Leu Tyr Gin Phe Phe Leu Ala Phe Ala Leu 

275 280 285 

Pro Phe Val Val He Thr Ala Ala Tyr Val Lys He Leu Gin Arg Met 
290 295 300 
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Thr Ser Ser Val Ala Pro Ala Ser Gin Arg Ser He Arg Leu Arg Thr 
o 05 310 315 320 

Lvs Arg Val Thr Arg Thr Ala He Ala He Cys Leu Val Phe Phe Val 

y 325 330 335 

Cvs Trp Ala Pro Tyr Tyr Val Leu Gin Leu Thr Gin Leu Ser He Ser 

340 345 350 

Arcr Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala He Ser Leu 

355 360 365 

Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr He Val Leu Cys 

370 375 380 

Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gin 

Glv Gin Leu Arg Thr Val Ser Asn Ala Gin Thr Ala Asp Glu Glu Arg 

405 410 415 

Thr Glu Ser Lys Gly Thr Val Asp Gly Thr Ala Gly Pro Gly Ser He 

420 425 43 0 

Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro 

435 440 445 

He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val 

450 455 460 

Ser Glv Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 
465 470 475 480 

Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val 

485 490 495 

Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His 

500 505 510 

Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val 

515 520 525 

Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg 

530 535 540 

Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu 
545 J 550 555 560 

Lvs Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu 

565 570 . 575 

Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin 

580 585 590 

Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp 

595 600 605 

Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly 

610 615 620 

aso Glv Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser 
625 630 635 640 

Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu 

645 650 655 

Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr 
660 665 670 

Lys 

<210> 15 
<211> 1269 
<212> DNA 
<213> Human 

<400> 15 

atatcagtgg gagccatgaa gaagggagtg gggagggcag ttgggcttgg aggcggcagc 
qgctgccagg ctacggagga agaccccctt cccaactgcg gggcttgcgc tccgggacaa 
qqtqgcaggc gctggaggct gccgcagcct gcgtgggtgg aggggagctc agctcggttg 
tqggagcagg cgaccggcac tggctggatg gacctggaag cctcgctgct gcccactggt 
cccaacgcca gcaacacctc tgatggcccc gataacctca cttcggcagg atcacctcct 
cacacgggga gcatctccta catcaacatc atcatgcctt cggtgttcgg caccatctgc 
ctcctgggca tcatcgggaa ctccacggtc atcttcgcgg tcgtgaagaa gtccaagctg 

- 14 - 



60 
120 
180 
240 
300 
360 
420 
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cactggtgca 
tttctcctgg 
ggggagacca 
tacatcctga 
acgaagttcc 
ttcatcagca 
gtgggctgcg 
cagtttttcc 
ctgcagcgca 
aagagggtga 
tactatgtgc 
ttatacaatg 
atcgtgctct 
gggcagcttc 
ggcacctga 



acaacgtccc 
gcatgccctt 
tgtgcaccct 
ccgccatggc 
ggaagccctc 
tcacccctgt 
gcatacgcct 
tggcctttgc 
tgacgtcctc 
cccgcacagc 
tacagctgac 
cggccatcag 
gtgagacgtt 
gcgctgtcag 



cgacatcttc 
catgatccac 
catcacggcc 
cattgaccgc 
tgtggccacc 
gtggctgtat 
gcccaaccca 
cctgcctttt 
agtggccccc 
catcgccatc 
ccagttgtcc 
cttgggctat 
ccgcaaacgc 
caacgctcag 



atcatcaacc 
cagctcatgg 
atggatgcca 
tacctggcca 
ctggtgatct 
gccagactca 
gacactgacc 
gtggtcatca 
gcctcccagc 
tgtctggtct 
atcagccgcc 
gccaacagct 
ttggtcctgt 
acggctgacg 



tctcggtagt 
gcaatggggt 
atagtcagtt 
ctgtccaccc 
gcctcctgtg 
tccccttccc 
tctactggtt 
cagccgcata 
gcagcatccg 
tctttgtgtg 
cgaccctcac 
gcctcaaccc 
cggtgaagcc 
aggagaggac 



agatctcctc 
gtggcacttt 
caccagcacc 
catctcttcc 
ggccctctcc 
aggaggtgca 
caccctgtac 
cgtgaggatc 
gctgcggaca 
ctgggcaccc 
ctttgtctac 
ctttgtgtac 
tgcagcccag 
agaaagcaaa 



<210> 16 
<211> 1062 
<212> DNA 
<213> Human 

<400> 16 
atggacctgg 
cccgataacc 
atcatcatgc 
gtcatcttcg 
ttcatcatca 
caccagctca 
gccatggatg 
cgctacctgg 
accctggtga 
tatgccagac 
ccagacactg 
tttgtggtca 
cccgcctccc 
atctgtctgg 
tccatcagcc 
tatgccaaca 
cgcttggtcc 
cagacggctg 

<210> 17 
<211> 2080 
<212> DNA 
<213> Mouse 



aagcctcgct 
tcacttcggc 
cttcggtgtt 
cggtcgtgaa 
acctctcggt 
tgggcaatgg 
ccaatagtca 
ccactgtcca 
tctgcctcct 
tcatcccctt 
acctctactg 
tcacagccgc 
agcgcagcat 
tcttctttgt 
gcccgaccct 
gctgcctcaa 
tgtcggtgaa 
acgaggagag 



gctgcccact 
aggatcacct 
cggcaccatc 
gaagtccaag 
agtagatctc 
ggtgtggcac 
gttcaccagc 
ccccatctct 
gtgggccctc 
cccaggaggt 
gttcaccctg 
atacgtgagg 
ccggctgcgg 
gtgctgggca 
cacctttgtc 
cccctttgtg 
gcctgcagcc 
gacagaaagc 



ggtcccaatg 
cctcgcacgg 
tgcctcctgg 
ctgcactggt 
ctctttctcc 
tttggggaga 
acctacatcc 
tccacgaagt 
tccttcatca 
gcagtgggct 
taccagtttt 
atcctgcagc 
acaaagaggg 
ccctactatg 
tacttataca 
tacatcgtgc 
caggggcagc 
aaaggcacct 



ccagcaacac 
ggagcatctc 
gcatcatcgg 
gcaacaacgt 
tgggcatgcc 
ccatgtgcac 
tgaccgccat 
tccggaagcc 
gcatcacccc 
gcggcatacg 
tcctggcctt 
gcatgacgtc 
tgacccgcac 
tgctacagct 
atgcggccat 
tctgtgagac 
ttcgcgctgt 
ga 



ctctgatggc 
ctacatcaac 
gaactccacg 
ccccgacatc 
cttcatgatc 
cctcatcacg 
ggccattgac 
ctctgtggcc 
tgtgtggctg 
cctgcccaac 
tgccctgcct 
ctcagtggcc 
agccatcgcc 
gacccagttg 
cagcttgggc 
gttccgcaaa 
cagcaacgct 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1269 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1062 



<220> 

<221> misc_feature 
<222> (1) • • • (2080) 
<223> n = A,T,C or G 



<400> 17 

ggcggtagag 

gggcgctgga 

cgacctgcac 

gcaacatctc 

gtgtctccta 

ttgtgggaaa 

gcaacgtccc 

gcatgccttt 

tgtgcaccct 

ctgctatggc 



gaagaccctt 
ggctgccgca 
cggctgcatg 
cgatggccag 
catcaacatc 
ctccacagtc 
tgacatcttc 
catgatccac 
catcacagcc 
cattgaccgc 



ttctggactg 
gcctgcgtgg 
gatctgcaag 
gataatttca 
atcatgcctt 
atttttgccg 
atcatcaacc 
cagctcatgg 
atggacgcca 
tacttggcca 



cggggctcaa 
gtggacgggc 
cctcgttgct 
cattggcggg 
cagtgtttgg 
tggtgaagaa 
tctctgtggt 
gtaatggtgt 
acagtcagtt 
ccgtccatcc 



gctccggaca 
gctccactcc 
gtccactggc 
gccacctcct 
taccatctgt 
atccaagctg 
ggatctgctt 
ctggcacttt 
caccagcacc 
catctcctcc 



aggcggtgga 
agggagcagg 
cccaatgcca 
cgcacaagga 
ctcctgggca 
cactggtgca 
ttcctgctgg 
ggggaaacca 
tacatcctga 
accaagttcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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ggaagccctc 
tcactcctgt 
gcatccgcct 
tggccttcgc 
tgacgtcttc 
cccgcacagc 
tgcagctgac 
cggccatcag 
gtgagacctt 
gcacggtcag 
aatccccccc 
gataaacccg 
ccacattcca 
cctcagactg 
tgactggttc 
ggcggaggtg 
ggctctactg 
gaaggcagtc 
tcctgtgcgg 
gagcccacag 
gagaatgaca 
acagtgacag 
gagcatccat 
ctttgtgctt 
cccaaaaaag 



catggccacc 
gtggctctat 
accaaaccca 
ccttccgttt 
ggtggcccca 
cattgccatc 
ccagttgtcc 
cttgggctat 
tcgaaaacgc 
caatgctcag 
ggtcacctcc 
gggctaccct 
cggggcccac 
cagggatccc 
tcggggtctg 
cctagctggg 
tgagactggg 
tttctcccaa 
atttcacgtt 
gagtttaaaa 
aggcaacctg 
cgacattctc 
gaattctgca 
gagccttccn 
gggnaaaaaa 



ctggtgatct 
gccaggctta 
gatactgatc 
gtggtcatca 
gcctctcaac 
tgtctggtct 
atcagccgcc 
gccaacagct 
ttggtgctgt 
acagctgacg 
aagtcaggtc 
gggaggatgc 
aaattgctag 
cttgacagaa 
tatctgttgg 
ttcctttaaa 
caagccgagc 
gctggtggat 
ctctttaggg 
aaccaaaaga 
gggtggatgt 
ttcatagggc 
tgtgcagggg 
ctcccaaatg 
aaaaaaaaaa 



gcctcctgtg 
tccccttccc 
tttactggtt 
ctgctgcgta 
gcagcatccg 
tctttgtgtg 
cgaccctcac 
gcctcaatcc 
cggtgaagcc 
aggagaggac 
accgcatcaa 
agaagctgga 
ggaggcttgc 
tagaagcgga 
ctcgcatata 
actaggcagg 
gttccctccc 
ctcctgaagc 
gatgcatgtt 
gctcagagtg 
ggatcttgaa 
acagctgtca 
tcactctaat 
ggaatgaaat 
aaaaaaaaaa 



ggctctctcg 
agggggtgct 
cactctgtat 
cgtgaaaata 
gcttcggaca 
ctgggcgccc 
attcgtctac 
ctttgtgtac 
cgcggcccag 
agaaagcaaa 
accatgggga 
ggctgggggc 
agcctggttt 
gcaagaagga 
tctttctctc 
gctaggatct 
atctctcatt 
acgctgcctg 
gacactgggg 
tcgagagaga 
actaataaaa 
gtctatggct 
acctgatatg 
aaaggcaaat 



ttcattagca 
gtgggctgtg 
cagtttttcc 
ctacagcgca 
aagagggtga 
tactacgtgc 
ctgtacaatg 
atagtactct 
gggcagcttc 
ggcacctgac 
gagatactga 
ttgtagcaaa 

gggggggaag 

aagggtggtt 
aagggaagaa 
gagcagctag 
ggtgttgata 
ggctccagca 
tgtgggctct 
cccaatcacc 
aggggttttc 
gatccagagc 
ttggcatcat 
tcccnccccc 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2080 



<210> 18 
<211> 3357 
<212> DNA 
<213> Mouse 



<220> 

<221> misc_feature 
<222> (1) • - ■ (3357) 
<223> n = A,T,C or G 



<400> 18 

ggcggtagag 

gggcgctgga 

cgacctgcac 

gcaacatctc 

cctcctccgg 

gaaactttat 

gaggagagga 

aggcacattt 

cttgctgatg 

aagcaagcaa 

ctttgattca 

gagtggtcct 

agaggattaa 

aaattctagc 

attcccaagc 

ctcaggagac 

agcctgaaac 

atctgaaacc 

cacacacaca 

aagccttttt 

atctgggagt 

cctctgagta 

gggtaaggta 

gacgtagtga 

caggttgtca. 



gaagaccctt 
ggctgccgca 
cggctgcatg 
cgatggccag 
gatgggtgtg 
cttacagttc 
ggggacacca 
agaaacaaca 
agtcccaaat 
gcaagacagc 
ccgctatcct 
tgctgtcttc 
cttttctggt 
tctgaaggag 
ttctggggtg 
atggtccaag 
acaccccaca 
tttggccatc 
cacacacaca 
gacaaggttt 
cttgactcct 
acaggtggat 
cagaactaac 
ccccaagctg 
gcttctagat 



ttctggactg 
gcctgcgtgg 
gatctgcaag 
gataatttca 
gaaaatggga 
ttggtgataa 
gctaggacag 
agggttggtg 
atgctttgca 
aagacagcgg 
gggctcaatc 
tggcaaaggc 
gaattaagct 
aagggaatga 
taacacagct 
gaaatccctg 
cccatggtga 
tctctctgaa 
cacacacaca 
tctggtggat 
ggccctctga 
ctagaaaatc 
aatagcccga 
caagggaaag 
cggaaggcgg 



cggggctcaa 
gtggacgggc 
cctcgttgct 
cattggcggg 
aggtttcacc 
gatctgcagt 
aaggggcagg 
acaagacgtg 

gggggggggg 
gcgggtagta 
ctctggcctc 
tgctgggagc 
tcttgacatt 
aggggaaaga 
ccagtcccta 
acagaaaacc 
cagtcacttc 
aagatgaggc 
cacacacaca 
cccggggata 
gtaaatggat 
ctataggagt 
gaaggggaaa 
catgaggggc 
gtcacacttg 



gctccggaca 
gctccactcc 
gtccactggc 
tgagtcgagt 
tcccaagcca 
cggctttgcc 
gaggaataga 
aggcaggctt 
ggggggaatc 
tgtgggagcc 
gcactgggga 
aaaagacttc 
tgcagaacgt 
gggaaggttg 
ccctattggg 
gggagagggc 
tcacatatgc 
tgcaaataca 
cacacacaca 
tgaagttgtt 
gaagcgaaga 
caccagggca 
cagcaggaga 
cagcaggaag 
ctctttctat 



aggcggtgga 
agggagcagg 
cccaatgcca 
tggagtcctc 
aactgcctgg 
tgaagaggaa 
gatggggcag 
gaggggaaag 
aaggctggag 
agcagaagcg 
aatggggtct 
acagggcgtg 
caatgcctta 
gtgtggagaa 
aaagcccaga 
agggctgtgg 
ctaggaacct 
cacacacaca 
aatgtccttc 
ctcagcagat 
agaatggggt 
cggtggagga 
tgattccaga 
gccgacatgg 
cctcagggcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



-16- 



WO 01/68706 



PCT/US01/08071 



acctcctcgc 
catctgtctc 
caagctgcac 
tctgcttttc 
gcactttggg 
cagcacctac 
ctcctccacc 
tctctcgttc 
gggtgctgtg 
tctgtatcag 
gaaaatacta 
tcggacaaag 
ggcgccctac 
cgtctacctg 
tgtgtacata 

ggcccagggg 
aagcaaaggc 
atggggagag 
tgggggcttg 
ctggtttggg 
agaaggaaag 
ttctctcaag 
aggatctgag 
tctcattggt 
ctgcctgggc 
actggggtgt 
agagagaccc 
aataaaaagg 
tatggctgat 
tgatatgttg 
ggcaaattcc 



acaaggagtg 
ctgggcattg 
tggtgcagca 
ctgctgggca 
gaaaccatgt 
atcctgactg 
aagttccgga 
attagcatca 
ggctgtggca 
tttttcctgg 
cagcgcatga 
agggtgaccc 
tacgtgctgc 
tacaatgcgg 
gtactctgtg 
cagcttcgca 
acctgacaat 
atactgagat 
tagcaaacca 
ggggaagcct 
ggtggtttga 
ggaagaaggc 
cagctagggc 
gttgatagaa 
tccagcatcc 
gggctctgag 
aatcaccgag 
ggttttcaca 
ccagagcgag 
gcatcatctt 
cncccccccc 



tctcctacat 
tgggaaactc 
acgtccctga 
tg'cctttcat 
gcaccctcat 
ctatggccat 
agccctccat 
ctcctgtgtg 
tccgcctacc 
ccttcgccct 
cgtcttcggt 
gcacagccat 
agctgaccca 
ccatcagctt 
agacctttcg 
cggtcagcaa 
cccccccggt 
aaacccgggg 
cattccacgg 
cagactgcag 
ctggttctcg 
ggaggtgcct 
tctactgtga 
ggcagtcttt 
tgtgcggatt 
cccacaggag 
aatgacaagg 
gtgacagcga 
catccatgaa 
tgtgcttgag 
aaaaaagggg 



caacatcatc 
cacagtcatt 
catcttcatc 
gatccaccag 
cacagccatg 
tgaccgctac 
ggccaccctg 
gctctatgcc 
aaacccagat 
tccgtttgtg 
ggccccagcc 
tgccatctgt 
gttgtccatc 
gggctatgcc 
aaaacgcttg 
tgctcagaca 
cacctccaag 
ctaccctggg 
ggcccacaaa 
ggatcccctt 
gggtctgtat 
agctgggttc 
gactgggcaa 
ctcccaagct 
tcacgttctc 
tttaaaaaac 
caacctgggg 
cattctcttc 
ttctgcatgt 
ccttccnctc 
naaaaaaaaa 



atgccttcag 
tttgccgtgg 
atcaacctct 
ctcatgggta 
gacgccaaca 
ttggccaccg 
gtgatctgcc 
aggcttatcc 
actgatcttt 
gtcatcactg 
tctcaacgca 
ctggtcttct 
agccgcccga 
aacagctgcc 
gtgctgtcgg 
gctgacgagg 
tcaggtcacc 
aggatgcaga 
ttgctaggga 
gacagaatag 
ctgttggctc 
ctttaaaact 
gccgagcgtt 
ggtggatctc 
tttaggggat 
caaaagagct 
tggatgtgga 
atagggcaca 
gcaggggtca 
ccaaatggga 
aaaaaaaaaa 



tgtttggtac 

tgaagaaatc 

ctgtggtgga 

atggtgtctg 

gtcagttcac 

tccatcccat 

tcctgtgggc 

ccttcccagg 

actggttcac 

ctgcgtacgt 

gcatccggct 

ttgtgtgctg 

ccctcacatt 

tcaatccctt 

tgaagcccgc 

agaggacaga 

gcatcaaacc 

agctggaggc 

ggcttgcagc 

aagcggagca 

gcatatatct 

aggcagggct 

ccctcccatc 

ctgaagcacg 

gcatgttgac 

cagagtgtcg 

tcttgaaact 

gctgtcagtc 

ctctaatacc 

atgaaataaa 

aaaaaaa 



<210> 19 
<211> 1062 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Human short form/mouse species chimeric MCH1R 



<400> 19 

atggacctgg 

cccgataacc 

atcatcatgc 

gtcatcttcg 

ttcatcatca 

caccagctca 

gccatggatg 

cgctacctgg 

accctggtga 

tatgccaggc 

ccagatactg 

tttgtggtca 

ccagcctctc 

atctgtctgg 

tccatcagcc 

tatgccaaca 

cgcttggtgc 

cagacagctg 

<210> 20 
<211> 1269 



aagcctcgct 
tcacttcggc 
cttcggtgtt 
cggtcgtgaa 
acctctcggt 
tgggcaatgg 
ccaatagtca 
ccactgtcca 
tctgcctcct 
ttatcccctt 
atctttactg 
tcactgctgc 
aacgcagcat 
tcttctttgt 
gcccgaccct 
gctgcctcaa 
tgtcggtgaa 
acgaggagag 



gctgcccact 
aggatcacct 
cggcaccatc 
gaagtccaag 
agtagatctc 
ggtgtggcac 
gttcaccagc 
ccccatctct 
gtgggctctc 
cccagggggt 
gttcactctg 
gtacgtgaaa 
ccggcttcgg 
gtgctgggcg 
cacattcgtc 
tccctttgtg 
gcccgcggcc 
gacagaaagc 



ggtcccaatg 
cctcgcacgg 
tgcctcctgg 
ctgcactggt 
ctctttctcc 
tttggggaga 
acctacatcc 
tccacgaagt 
tcgttcatta 
gctgtgggct 
tatcagtttt 
atactacagc 
acaaagaggg 
ccctactacg 
tacctgtaca 
tacatagtac 
caggggcagc 
aaaggcacct 



ccagcaacac 
ggagcatctc 
gcatcatcgg 
gcaacaacgt 
tgggcatgcc 
ccatgtgcac 
tgaccgccat 
tccggaagcc 
gcatcactcc 
gtggcatccg 
tcctggcctt 
gcatgacgtc 
tgacccgcac 
tgctgcagct 
atgcggccat 
tctgtgagac 
ttcgcacggt 
ga 



ctctgatggc 
ctacatcaac 
gaactccacg 
ccccgacatc 
cttcatgatc 
cctcatcacg 
ggccattgac 
ctccatggcc 
tgtgtggctc 
cctaccaaac 
cgcccttccg 
ttcggtggcc 
agccattgcc 
gacccagttg 
cagcttgggc 
ctttcgaaaa 
cagcaatgct 



1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3357 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1062 
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<212> DNA 

<213> Artificial Sequence 

<220> . 

<223> Human long form/mouse species chimeric MCH1R 



<400> 20 

atgtcagtgg 

ggctgccagg 

ggtggcaggc 

tgggagcagg 

cccaacgcca 

cgcacgggga 

ctcctgggca 

cactggtgca 

tttctcctgg 

ggggagacca 

tacatcctga 

acgaagttcc 

ttcattagca 

gtgggctgtg 

cagtttttcc 

ctacagcgca 

aagagggtga 

tactacgtgc 

ctgtacaatg 

atagtactct 

gggcagcttc 

ggcacctga 



gagccatgaa 
ctacggagga 
gctggaggct 
cgaccggcac 
gcaacacctc 
gcatctccta 
tcatcgggaa 
acaacgtccc 
gcatgccctt 
tgtgcaccct 
ccgccatggc 
ggaagccctc 
tcactcctgt 
gcatccgcct 
tggccttcgc 
tgacgtcttc 
cccgcacagc 
tgcagctgac 
cggccatcag 
gtgagacctt 
gcacggtcag 



gaagggagtg 
agaccccctt 
gccgcagcct 
tggctggatg 
tgatggcccc 
catcaacatc 
ctccacggtc 
cgacatcttc 
catgatccac 
catcacggcc 
cattgaccgc 
catggccacc 
gtggctctat 
accaaaccca 
ccttccgttt 
ggtggcccca 
cattgccatc 
ccagttgtcc 
cttgggctat 
tcgaaaacgc 
caatgctcag 



gggagggcag 
cccaactgcg 
gcgtgggtgg 
gacctggaag 
gataacctca 
atcatgcctt 
atcttcgcgg 
atcatcaacc 
cage tea tgg 
atggatgcca 
tacctggcca 
ctggtgatct 
gecaggctta 
gatactgatc 
gtggtcatca 
gcctctcaac 
tgtctggtct 
atcagccgcc 
gccaacagct 
ttggtgctgt 
acagctgacg 



ttgggcttgg 
gggcttgege 
aggggagctc 
cctcgctgct 
etteggcagg 
cggtgttcgg 
tcgtgaagaa 
teteggtagt 
gcaatggggt 
atagtcagtt 
ctgtccaccc 
gcctcctgtg 
tccccttccc 
tttactggtt 
ctgctgcgta 
gcagcatccg 
tctttgtgtg 
cgaccctcac 
gcctcaatcc 
eggtgaagee 
aggagaggac 



aggeggcage 
teegggacaa 
agctcggttg 
gcccactggt 
atcacctcct 
caccatctgc 
gtccaagctg 
agatctcctc 
gtggcacttt 
caccagcacc 
catctcttcc 
ggctctctcg 
agggggtget 
cactctgtat 
cgtgaaaata 
getteggaca 
ctgggcgccc 
attegtctae 
ctttgtgtac 
cgcggcccag 
agaaagcaaa 



<210> 21 
<211> 966 
<212> DNA 

<213> Aeguorea Victoria 



<400> 21 

tacacacgaa 

cccaattctt 

gggtgaaggt 

actacctgtt 

aagataccca 

tgtacaggaa 

caagtttgaa 

agatggaaac 

catggcagac 

agatggaagc 

tgtcctttta 

cgaaaagaga 

catggatgaa 

attactaaaa 

agattcatta 

taaacaggct 

ctcaaa 



taaaagataa 
gttgaattag 
gatgeaacat 
ccatggccaa 
gatcatatga 
agaactatat 
ggtgataccc 
attcttggac 
aaacaaaaga 
gttcaactag 
ccagacaacc 
gaccacatgg 
ctatacaaat 
tctcagggtt 
aaattgtatg 
acttggagtg 



caaagatgag 
atggtgatgt 
aeggaaaact 
cacttgtcac 
aacagcatga 
ttttcaaaga 
ttgttaatag 
acaaattgga 
atggaatcaa 
cagaccatta 
attacctgtc 
tccttcttga 
aaatgtccag 
cctggttaaa 
aataatttat 
tattcttaat 



taaaggagaa 
taatgggcac 
tacccttaaa 
tactttctct 
ctttttcaag 
tgaegggaac 
aatcgagtta 
atacaactat 
agttaacttc 
tcaacaaaat 
cacacaatct 
gtttgtaaca 
acttccaatt 
ttcaggctga 
tgatgttatt 
tctatattaa 



gaacttttca 
aaattttctg 
tttatttgea 
tatggtgttc 
agtgccatgc 
tacaagacac 
aaaggtattg 
aactcacaca 
aaaattagac 
actccaattg 
gccctttcga 
gctgctggga 
gacactaaag 
gatattattt 
gatagaggtt 
ttacaatttg 



ctggagttgt 
tcagtggaga 
ctactggaaa 
aatgetttte 
ccgaaggtta 
gtgctgaagt 
attttaaaga 
atgtatacat 
acaacattga 
gcgatggccc 
aagatcccaa 
ttacacatgg 
tgtccgaaca 
atatatttat 
attttcttat 
atttgacttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1269 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
966 



<210> 22 
<211> 765 . 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> GFP derivative 
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qtcqacggta ccgcgggccc gggatccatc gccaccatgg tgagcaaggg cgaggagctg 60 

ttcaccgggg tggtgcccat cctggtcgag ctggacggcg acgtaaacgg ccacaagttc 120 

aacqtgtccg gcgagggcga gggcgatgcc acctacggca agctgaccct gaagttcatc 180 

tacaccaccg gcaagctgcc cgtgccctgg cccaccctcg tgaccaccct gacctacggc 240 

crtacagtgct tcagccgcta ccccgaccac atgaagcagc acgacttctt caagtccgcc 300 

atacccgaag gctacgtcca ggagcgcacc atcttcttca aggacgacgg caactacaag 360 

acccacgccg aggtgaagtt cgagggcgac accctggtga accgcatcga gctgaagggc 420 

atcqacttca aggaggacgg caacatcctg gggcacaagc tggagtacaa ctacaacagc 480 

cacaacgtct atatcatggc cgacaagcag aagaacggca tcaaggtgaa cttcaagatc 540 

caccacaaca tcgaggacgg cagcgtgcag ctcgccgacc actaccagca gaacaccccc 600 

atcaacgacg gccccgtgct gctgcccgac aaccactacc tgagcaccca gtccgccctg 660 

agcaaagacc ccaacgagaa gcgcgatcac atggtcctgc tggagttcgt gaccgccgcc 720 

gggatcactc tcggcatgga cgagctgtac aagtaaagcg gccgc 765 

<210> 23 
<211> 720 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> GFP derivative 

atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60 

aacaacqtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120 

aacaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180 

ctcgtgacca ccttgaccta cggcgtgcag tgcttcgccc gctaccccga ccacatgaag 240 

caqcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300 

ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360 

gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 420 

aaactggagt acaactacaa cagccacaag gtctatatca ccgccgacaa gcagaagaac 480 

qqcatcaagg tgaacttcaa gacccgccac aacatcgagg acggcagcgt gcagctcgcc 540 

aaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600 

tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660 

ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaagtaa 720 

<210> 24 
<211> 720 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> GFP derivative 

atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60 

aacqacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120 

qqcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180 

ctcgtgacca ccttcggcta cggcgtgcag tgcttcgccc gctaccccga ccacatgcgc 240 

caqcacgact. tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300 

ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360 

qtaaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 420 

aaactggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac 480 

qqcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc 540 

aaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600 

tacctgagct accagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660 

ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaagtaa 720 

<210> 25 
<211> 720 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> GFP derivative 



<400> 25 

atggtgagca 

ggcgacgtaa 

ggcaagctga 

ctcgtgacca 

cagcacgact 

ttcaaggacg 

gtgaaccgca 

aagctggagt 

ggcatcaagg 

gaccactacc 

tacctgagca 

ctgctggagt 



agggcgagga 
acggccacag 
ccctgaagtt 
ccctgacctg 
tcttcaagtc 
acggcaacta 
tcgagctgaa 
acaactacat 
cccacttcaa 
agcagaacac 
cccagtccgc 
tcgtgaccgc 



gctgttcacc 
gttcagcgtg 
catctgcacc 
gggcgtgcag 
cgccatgccc 
caagacccgc 
gggcatcgac 
cagccacaac 
gatccgccac 
ccccatcggc 
cctgagcaaa 
cgccgggatc 



ggggtggtgc 
tccggcgagg 
accggcaagc 
tgcttcagcc 
gaaggctacg 
gccgaggtga 
ttcaaggagg 
gtctatatca 
aacatcgagg 
gacggccccg 
gaccccaacg 
actctcggca 



ccatcctggt 
gcgagggcga 
tgcccgtgcc 
gctaccccga 
tccaggagcg 
agttcgaggg 
acggcaacat 
ccgccgacaa 
acggcagcgt 
tgctgctgcc 
agaagcgcga 
tggacgagct 



<210> 26 
<211> 3092 
<212> DNA 

<213> Artificial Sequence 



cgagctggac 
tgccacctac 
ctggcccacc 
ccacatgaag 
taccatcttc 
cgacaccctg 
cctggggcac 
gcagaagaac 
gcagctcgcc 
cgacaaccac 
tcacatggtc 
gtacaagtaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



<220> 

<223> Mouse MCHlR-linker-EGFP 



<400> 26 

atggatctgc 

caggataatt 

gtggaaaatg 

ttcttggtga 

ccagctagga 

acaagggttg 

aatacgcttt 

agcaagacag 

cctgggctca 

ttctggcaaa 

ggtgaattaa 

gagaagggaa 

gtgtaacaca 

aaggaaatcc 

acacccatgg 

atctctctct 

acacacacac 

ttttctggtg 

cctggccctc 

gatctagaaa 

aacaatagcc 

ctgcaaggga 

gatcggaagg 

gtgtctccta 

ttgtgggaaa 

gcaacgtccc 

gcatgccttt 

tgtgcaccct 

ctgctatggc 

ggaagccctc 

tcactcctgt 

gcatccgcct 

tggccttcgc 

tgacgtcttc 



aagcctcgtt 
tcacattggc 
ggaaggtttc 
taagatctgc 
cagaaggggc 
gtgacaagac 
gcaggggggg 
cgggcgggta 
atcctctggc 
ggctgctggg 
gcttcttgac 
tgaaggggaa 
gctccagtcc 
ctgacagaaa 
tgacagtcac 
gaaaagatga 
acacacacac 
gatcccgggg 
tgagtaaatg 
atcctatagg 
cgagaagggg 
aagcatgagg 
cgggtcacac 
catcaacatc 
ctccacagtc 
tgacatcttc 
catgatccac 
catcacagcc 
cattgaccgc 
catggccacc 
gtggctctat 
accaaaccca 
ccttccgttt 
ggtggcccca 



gctgtccact 
gggtgagtcg 
acctcccaag 
agtcggcttt 
agggaggaat 
gtgaggcagg 

ggggggggga 

gtatgtggga 
ctcgcactgg 
agcaaaagac 
atttgcagaa 
agagggaagg 
ctaccctatt 
accgggagag 
ttctcacata 
ggctgcaaat 
acacacacac 
atatgaagtt 
gatgaagcga 
agtcaccagg 
aaacagcagg 
ggccagcagg 
ttgctctttc 
atcatgcctt 
atttttgccg 
atcatcaacc 
cagctcatgg 
atggacgcca 
tacttggcca 
ctggtgatct 
gccaggctta 
gatactgatc 
gtggtcatca 
gcctctcaac 



ggccccaatg 
agttggagtc 
ccaaactgcc 
gcctgaagag 
agagatgggg 
cttgagggga 
atcaaggctg 
gccagcagaa 
ggaaatgggg 
ttcacagggc 
cgtcaatgcc 
ttggtgtgga 
gggaaagccc 
ggcagggctg 
tgcctaggaa 
acacacacac 
acaaatgtcc 
gttctcagca 
agaagaatgg 
gcacggtgga 
agatgattcc 
aaggccgaca 
tatcctcagg 
cagtgtttgg 
tggtgaagaa 
tctctgtggt 
gtaatggtgt 
acagtcagtt 
ccgtccatcc 
gcctcctgtg 
tccccttccc 
tttactggtt 
ctgctgcgta 
gcagcatccg 



ccagcaacat 
ctccctcctc 
tgggaaactt 
gaagaggaga 
cagaggcaca 
aagcttgctg 
gagaagcaag 
gcgctttgat 
tctgagtggt 
gtgagaggat 
ttaaaattct 
gaaattccca 
agactcagga 
tggagcctga 
cctatctgaa 
acacacacac 
ttcaagcctt 
gatatctggg 
ggtcctctga 
ggagggtaag 
agagacgtag 
tggcaggttg 
gccacctcct 
taccatctgt 
atccaagctg 
ggatctgctt 
ctggcacttt 
caccagcacc 
catctcctcc 
ggctctctcg 
agggggtgct 
cactctgtat 
cgtgaaaata 
gcttcggaca 



ctccgatggc 

cgggatgggt 

tatcttacag 

ggaggggaca 

tttagaaaca 

atgagtccca 

caagcaagac 

tcaccgctat 

ccttgctgtc 

taacttttct 

agctctgaag 

agcttctggg 

gacatggtcc 

aacacacccc 

acctttggcc 

acacacacac 

tttgacaagg 

agtcttgact 

gtaacaggtg 

gtacagaact 

tgaccccaag 

tcagcttcta 

cgcacaagga 

ctcctgggca 

cactggtgca 

ttcctgctgg 

ggggaaacca 

tacatcctga 

accaagttcc 

ttcattagca 

gtgggctgtg 

cagtttttcc 

ctacagcgca 

aagagggtga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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cccgcacagc 
tgcagctgac 
cggccatcag 
gtgagacctt 
gcacggtcag 
acggtaccgc 
ccggggtggt 
tgtccggcga 
ccaccggcaa 
agtgcttcag 
ccgaaggcta 
gcgccgaggt 
acttcaagga 
acgtctatat 
acaacatcga 
gcgacggccc 
aagaccccaa 
tcactctcgg 



cattgccatc 
ccagttgtcc 
cttgggctat 
tcgaaaacgc 
caatgctcag 
gggcccggga 
gcccatcctg 
gggcgagggc 
gctgcccgtg 
ccgctacccc 
cgtccaggag 
gaagttcgag 
ggacggcaac 
catggccgac 
ggacggcagc 
cgtgctgctg 
cgagaagcgc 
catggacgag 



tgtctggtct 
atcagccgcc 
gccaacagct 
ttggtgctgt 
acagctgacg 
tccatcgcca 
gtcgagctgg 
gatgccacct 
ccctggccca 
gaccacatga 
cgcaccatct 
ggcgacaccc 
atcctggggc 
aagcagaaga 
gtgcagctcg 
cccgacaacc 
gatcacatgg 
ctgtacaagt 



tctttgtgtg 
cgaccctcac 
gcctcaatcc 
cggtgaagcc 
aggagaggac 
ccatggtgag 
acggcgacgt 
acggcaagct 
ccctcgtgac 
agcagcacga 
tcttcaagga 
tggtgaaccg 
acaagctgga 
acggcatcaa 
ccgaccacta 
actacctgag 
tcctgctgga 
aa 



ctgggcgccc 
attcgtctac 
ctttgtgtac 
cgcggcccag 
agaaagcaaa 
caagggcgag 
aaacggccac 
gaccctgaag 
caccctgacc 
cttcttcaag 
cgacggcaac 
catcgagctg 
gtacaactac 
ggtgaacttc 
ccagcagaac 
cacccagtcc 
gttcgtgacc 



tactacgtgc 
ctgtacaatg 
atagtactct 
gggcagcttc 
ggcaccgtcg 
gagctgttca 
aagttcagcg 
ttcatctgca 
tacggcgtgc 
tccgccatgc 
tacaagaccc 
aagggcatcg 
aacagccaca 
aagatccgcc 
acccccatcg 
gccctgagca 
gccgccggga 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3092 



<210> 27 

<211> 3056 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Mouse MCH1R/EGFP direct fusion 



<400> 27 

atggatctgc 

caggataatt 

gtggaaaatg 

ttcttggtga 

ccagctagga 

acaagggttg 

aatatgcttt 

agcaagacag 

cctgggctca 

ttctggcaaa 

ggtgaattaa 

gagaagggaa 

gtgtaacaca 

aaggaaatcc 

acacccatgg 

atctctctct 

acacacacac 

ttttctggtg 

cctggccctc 

gatctagaaa 

aacaatagcc 

ctgcaaggga 

gatcggaagg 

gtgtctccta 

ttgtgggaaa 

gcaacgtccc 

gcatgccttt 

tgtgcaccct 

ctgctatggc 

ggaagccctc 

tcactcctgt 

gcatccgcct 

tggccttcgc 

tgacgtcttc 



aagcctcgtt 
tcacattggc 
ggaaggtttc 
taagatctgc 
cagaaggggc 
gtgacaagac 
gcaggggggg 
cgggcgggta 
atcctctggc 
ggctgctggg 
gcttcttgac 
tgaaggggaa 
gctccagtcc 
ctgacagaaa 
tgacagtcac 
gaaaagatga 
acacacacac 
gatcccgggg 
tgagtaaatg 
atcctatagg 
cgagaagggg 
aagcatgagg 
cgggtcacac 
catcaacatc 
ctccacagtc 
tgacatcttc 
catgatccac 
catcacagcc 
cattgaccgc 
catggccacc 
gtggctctat 
accaaaccca 
ccttccgttt 
ggtggcccca 



gctgtccact 
gggtgagtcg 
acctcccaag 
agtcggcttt 
agggaggaat 
gtgaggcagg 

ggggggggga 

gtatgtggga 
ctcgcactgg 
agcaaaagac 
atttgcagaa 
agagggaagg 
ctaccctatt 
accgggagag 
ttctcacata 
ggctgcaaat 
acacacacac 
atatgaagtt 
gatgaagcga 
agtcaccagg 
aaacagcagg 
ggccagcagg 
ttgctctttc 
atcatgcctt 
atttttgccg 
atcatcaacc 
cagctcatgg 
atggacgcca 
tacttggcca 
ctggtgatct 
gccaggctta 
gatactgatc 
gtggtcatca 
gcctctcaac 



ggccccaatg 
agttggagtc 
ccaaactgcc 
gcctgaagag 
agagatgggg 
cttgagggga 
atcaaggctg 
gccagcagaa 
ggaaatgggg 
ttcacagggc 
cgtcaatgcc 
ttggtgtgga 
gggaaagccc 
ggcagggctg 
tgcctaggaa 
acacacacac 
acaaatgtcc 
gttctcagca 
agaagaatgg 
gcacggtgga 
agatgattcc 
aaggccgaca 
tatcctcagg 
cagtgtttgg 
tggtgaagaa 
tctctgtggt 
gtaatggtgt 
acagtcagtt 
ccgtccatcc 
gcctcctgtg 
tccccttccc 
tttactggtt 
ctgctgcgta 
gcagcatccg 



ccagcaacat 
ctccctcctc 
tgggaaactt 
gaagaggaga 
cagaggcaca 
aagcttgctg 
gagaagcaag 
gcgctttgat 
tctgagtggt 
gtgagaggat 
ttaaaattct 
gaaattccca 
agactcagga 
tggagcctga 
cctatctgaa 
acacacacac 
ttcaagcctt 
gatatctggg 
ggtcctctga 
ggagggtaag 
agagacgtag 
tggcaggttg 
gccacctcct 
taccatctgt 
atccaagctg 
ggatctgctt 
ctggcacttt 
caccagcacc 
catctcctcc 
ggctctctcg 
agggggtgct 
cactctgtat 
cgtgaaaata 
gcttcggaca 



ctccgatggc 

cgggatgggt 

tatcttacag 

ggaggggaca 

tttagaaaca 

atgagtccca 

caagcaagac 

tcaccgctat 

ccttgctgtc 

taacttttct 

agctctgaag 

agcttctggg 

gacatggtcc 

aacacacccc 

acctttggcc 

acacacacac 

tttgacaagg 

agtcttgact 

gtaacaggtg 

gtacagaact 

tgaccccaag 

tcagcttcta 

cgcacaagga 

ctcctgggca 

cactggtgca 

ttcctgctgg 

ggggaaacca 

tacatcctga 

accaagttcc 

ttcattagca 

gtgggctgtg 

cagtttttcc 

ctacagcgca 

aagagggtga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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cccgcacagc 
tgcagctgac 
cggccatcag 
gtgagacctt 
gcacggtcag 
tgagcaaggg 
acgtaaacgg 
agctgaccct 
tgaccaccct 
acgacttctt 
aggacgacgg 
accgcatcga 
tggagtacaa 
tcaaggtgaa 
actaccagca 
tgagcaccca 
tggagttcgt 



cattgccatc 
ccagttgtcc 
cttgggctat 
tcgaaaacgc 
caatgctcag 
cgaggagctg 
ccacaagttc 
gaagttcatc 
gacctacggc 
caagtccgcc 
caactacaag 
gctgaagggc 
ctacaacagc 
cttcaagatc 
gaacaccccc 
gtccgccctg 
gaccgccgcc 



tgtctggtct 
atcagccgcc 
gccaacagct 
ttggtgctgt 
acagctgacg 
ttcaccgggg 
agcgtgtccg 
tgcaccaccg 
gtgcagtgct 
atgcccgaag 
acccgcgccg 
atcgacttca 
cacaacgtct 
cgccacaaca 
atcggcgacg 
agcaaagacc 
gggatcactc 



tctttgtgtg 
cgaccctcac 
gcctcaatcc 
cggtgaagcc 
aggagaggac 
tggtgcccat 
gcgagggcga 
gcaagctgcc 
tcagccgcta 
gctacgtcca 
aggtgaagtt 
aggaggacgg 
atatcatggc 
tcgaggacgg 
gccccgtgct 
ccaacgagaa 
tcggcatgga 



ctgggcgccc 
attcgtctac 
ctttgtgtac 
cgcggcccag 
agaaagcaaa 
cctggtcgag 
gggcgatgcc 
cgtgccctgg 
ccccgaccac 
ggagcgcacc 
cgagggcgac 
caacatcctg 
cgacaagcag 
cagcgtgcag 
gctgcccgac 
gcgcgatcac 
cgagctgtac 



tactacgtgc 
ctgtacaatg 
atagtactct 
gggcagcttc 
ggcaccatgg 
ctggacggcg 
acctacggca 
cccaccctcg 
atgaagcagc 
atcttcttca 
accctggtga 
gggcacaagc 
aagaacggca 
ctcgccgacc 
aaccactacc 
atggtcctgc 
aagtaa 



<210> 28 
<211> 1815 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Human short form/mouse species chimeric 
MCHlR-linker-EGFP 



<400> 28 

atggacctgg 

cccgataacc 

atcatcatgc 

gtcatcttcg 

ttcatcatca 

caccagctca 

gccatggatg 

cgctacctgg 

accctggtga 

tatgccaggc 

ccagatactg 

tttgtggtca 

ccagcctctc 

atctgtctgg 

tccatcagcc 

tatgccaaca 

cgcttggtgc 

cagacagctg 

ggatccatcg 

ctggtcgagc 

ggcgatgcca 

gtgccctggc 

cccgaccaca 

gagcgcacca 

gagggcgaca 

aacatcctgg 

gacaagcaga 

agcgtgcagc 

ctgcccgaca 

cgcgatcaca 

gagctgtaca 

<210> 29 
<211> 2022 



aagcctcgct 
tcacttcggc 
cttcggtgtt 
cggtcgtgaa 
acctctcggt 
tgggcaatgg 
ccaatagtca 
ccactgtcca 
tctgcctcct 
ttatcccctt 
atctttactg 
tcactgctgc 
aacgcagcat 
tcttctttgt 
gcccgaccct 
gctgcctcaa 
tgtcggtgaa 
acgaggagag 
ccaccatggt 
tggacggcga 
cctacggcaa 
ccaccctcgt 
tgaagcagca 
tcttcttcaa 
ccctggtgaa 
ggcacaagct 
agaacggcat 
tcgccgacca 
accactacct 
tggtcctgct 
agtaa 



gctgcccact 
aggatcacct 
cggcaccatc 
gaagtccaag 
agtagatctc 
ggtgtggcac 
gttcaccagc 
ccccatctct 
gtgggctctc 
cccagggggt 
gttcactctg 
gtacgtgaaa 
ccggcttcgg 
gtgctgggcg 
cacattcgtc 
tccctttgtg 
gcccgcggcc 
gacagaaagc 
gagcaagggc 
cgtaaacggc 
gctgaccctg 
gaccaccctg 
cgacttcttc 
ggacgacggc 
ccgcatcgag 
ggagtacaac 
caaggtgaac 
ctaccagcag 
gagcacccag 
ggagttcgtg 



ggtcccaatg 
cctcgcacgg 
tgcctcctgg 
ctgcactggt 
ctctttctcc 
tttggggaga 
acctacatcc 
tccacgaagt 
tcgttcatta 
gctgtgggct 
tatcagtttt 
atactacagc 
acaaagaggg 
ccctactacg 
tacctgtaca 
tacatagtac 
caggggcagc 
aaaggcaccg 
gaggagctgt 
cacaagttca 
aagttcatct 
acctacggcg 
aagtccgcca 
aactacaaga 
ctgaagggca 
tacaacagcc 
ttcaagatcc 
aacaccccca 
tccgccctga 
accgccgccg 



ccagcaacac 

ggagcatctc 

gcatcatcgg 

gcaacaacgt 

tgggcatgcc 

ccatgtgcac 

tgaccgccat 

tccggaagcc 

gcatcactcc 

gtggcatccg 

tcctggcctt 

gcatgacgtc 

tgacccgcac 

tgctgcagct 

atgcggccat 

tctgtgagac 

ttcgcacggt 

tcgacggtac 

tcaccggggt 

gcgtgtccgg 

gcaccaccgg 

tgcagtgctt 

tgcccgaagg 

cccgcgccga 

tcgacttcaa 

acaacgtcta 

gccacaacat 

tcggcgacgg 

gcaaagaccc 

ggatcactct 



ctctgatggc 

ctacatcaac 

gaactccacg 

ccccgacatc 

cttcatgatc 

cctcatcacg 

ggccattgac 

ctccatggcc 

tgtgtggctc 

cctaccaaac 

cgcccttccg 

ttcggtggcc 

agccattgcc 

gacccagttg 

cagcttgggc 

ctttcgaaaa 

cagcaatgct 

cgcgggcccg. 

ggtgcccatc 

cgagggcgag 

caagctgccc 

cagccgctac 

ctacgtccag 

ggtgaagttc 

ggaggacggc 

tatcatggcc 

cgaggacggc 

ccccgtgctg 

caacgagaag 

cggcatggac 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3056 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1815 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Human long form/mouse species chimeric 
MCH1R-1 inker- EGFP 



<400> 29 

atgtcagtgg 

ggctgccagg 

ggtggcaggc 

tgggagcagg 

cccaacgcca 

cgcacgggga 

ctcctgggca 

cactggtgca 

tttctcctgg 

ggggagacca 

tacatcctga 

acgaagttcc 

ttcattagca 

gtgggctgtg 

cagtttttcc 

ctacagcgca 

aagagggtga 

tactacgtgc 

ctgtacaatg 

atagtactct 

gggcagcttc 

ggcaccgtcg 

gagctgttca 

aagttcagcg 

ttcatctgca 

tacggcgtgc 

tccgccatgc 

tacaagaccc 

aagggcatcg 

aacagccaca 

aagatccgcc 

acccccatcg 

gccctgagca 

gccgccggga 



gagccatgaa 
ctacggagga 
gctggaggct 
cgaccggcac 
gcaacacctc 
gcatctccta 
tcatcgggaa 
acaacgtccc 
gcatgccctt 
tgtgcaccct 
ccgccatggc 
ggaagccctc 
tcactcctgt 
gcatccgcct 
tggccttcgc 
tgacgtcttc 
cccgcacagc 
tgcagctgac 
cggccatcag 
gtgagacctt 
gcacggtcag 
acggtaccgc 
ccggggtggt 
tgtccggcga 
ccaccggcaa 
agtgcttcag 
ccgaaggcta 
gcgccgaggt 
acttcaagga 
acgtctatat 
acaacatcga 
gcgacggccc 
aagaccccaa 
tcactctcgg 



gaagggagtg 
agaccccctt 
gccgcagcct 
tggctggatg 
tgatggcccc 
catcaacatc 
ctccacggtc 
cgacatcttc 
catgatccac 
catcacggcc 
cattgaccgc 
catggccacc 
gtggctctat 
accaaaccca 
ccttccgttt 
ggtggcccca 
cattgccatc 
ccagttgtcc 
cttgggctat 
tcgaaaacgc 
caatgctcag 
gggcccggga 
gcccatcctg 
gggcgagggc 
gctgcccgtg 
ccgctacccc 
cgtccaggag 
gaagttcgag 
ggacggcaac 
catggccgac 
ggacggcagc 
cgtgctgctg 
cgagaagcgc 
catggacgag 



gggagggcag 
cccaactgcg 
gcgtgggtgg 
gacctggaag 
gataacctca 
atcatgcctt 
atcttcgcgg 
atcatcaacc 
cagctcatgg 
atggatgcca 
tacctggcca 
ctggtgatct 
gccaggctta 
gatactgatc 
gtggtcatca 
gcctctcaac 
tgtctggtct 
atcagccgcc 
gccaacagct 
ttggtgctgt 
acagctgacg 
tccatcgcca 
gtcgagctgg 
gatgccacct 
ccctggccca 
gaccacatga 
cgcaccatct 
ggcgacaccc 
atcctggggc 
aagcagaaga 
gtgcagctcg 
cccgacaacc 
gatcacatgg 
ctgtacaagt 



ttgggcttgg 
gggcttgcgc 
aggggagctc 
cctcgctgct 
cttcggcagg 
cggtgttcgg 
tcgtgaagaa 
tctcggtagt 
gcaatggggt 
atagtcagtt 
ctgtccaccc 
gcctcctgtg 
tccccttccc 
tttactggtt 
ctgctgcgta 
gcagcatccg 
tctttgtgtg 
cgaccctcac 
gcctcaatcc 
cggtgaagcc 
aggagaggac 
ccatggtgag 
acggcgacgt 
acggcaagct 
ccctcgtgac 
agcagcacga 
tcttcaagga 
tggtgaaccg 
acaagctgga 
acggcatcaa 
ccgaccacta 
actacctgag 
tcctgctgga 
aa 



<210> 30 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Linker 
<400> 30 

tcgacggtac cgcgggcccg ggatccatcg ccacc 

<210> 31 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Linker 



aggcggcagc 

tccgggacaa 

agctcggttg 

gcccactggt 

atcacctcct 

caccatctgc 

gtccaagctg 

agatctcctc 

gtggcacttt 

caccagcacc 

catctcttcc 

ggctctctcg 

agggggtgct 

cactctgtat 

cgtgaaaata 

gcttcggaca 

ctgggcgccc 

attcgtctac 

ctttgtgtac 

cgcggcccag 

agaaagcaaa 

caagggcgag 

aaacggccac 

gaccctgaag 

caccctgacc 

cttcttcaag 

cgacggcaac 

catcgagctg 

gtacaactac 

ggtgaacttc 

ccagcagaac 

cacccagtcc 

gttcgtgacc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2022 
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<400> 31 

Val Asp Gly Thr Ala Gly Pro Gly Ser lie Ala Thr 
1 5 10 

<210> 32 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 

<400> 32 

gcgaattcac catggatctg caagcctcg 

<210> 33 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 33 

gcgtcgacgg tgcctttgct ttctgtcc ^ 

<210> 34 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 34 

ccttgctcac catggtgcct ttgctttctg tec 

<210> 35 \ 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 35 

cagaaagcaa aggcaccatg gtgagcaagg gegaggage Jy 

<210> 36 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 36 

ggeggatect etagagtege ggee 

<210> 37 
<211> 38 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 37 

gctctagagt cgcggccgct tacttgtaca gctcgtcc 
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